Baichuan
3 models2023Up to 4K ctx
About
Baichuan 2, developed by Baichuan Intelligent Technology, is an advanced series of open-source large language models. It has been trained on a substantial corpus of 2.6 trillion tokens, showing remarkable performance across various Chinese and English benchmarks. The model family comprises 7B and 13B parameter versions, offered in both base and chat configurations, and includes a 4-bit quantized chat model for efficient deployment. These models are available for both academic research and commercial use under an official license. Leveraging PyTorch 2.0's F.scaled_dot_product_attention, they ensure faster inference. Additionally, intermediate checkpoints from its training are provided, supporting research into model training dynamics 23.
Specifications(3 models)
| Model | Released | Context | Parameters |
|---|---|---|---|
| Baichuan 13B Chat | 2023-06 | — | 13B |
| Baichuan 13B | 2023-06 | 4K | 13B |
| Baichuan 7B | 2023-06 | 4K | 7B |
Frequently Asked Questions
- What is Baichuan?
- Baichuan 2, developed by Baichuan Intelligent Technology, is an advanced series of open-source large language models. It has been trained on a substantial corpus of 2.6 trillion tokens, showing remarkable performance across various Chinese and English benchmarks. The model family comprises 7B and 13B parameter versions, offered in both base and chat configurations, and includes a 4-bit quantized chat model for efficient deployment. These models are available for both academic research and commercial use under an official license. Leveraging PyTorch 2.0's F.scaled_dot_product_attention, they ensure faster inference. Additionally, intermediate checkpoints from its training are provided, supporting research into model training dynamics 23.
- How many models are in the Baichuan family?
- The Baichuan family contains 3 models.
- What is the latest Baichuan model?
- The latest model is Baichuan 13B Chat, released in 2023-06.



