LLM ReferenceLLM Reference
3 models2023Up to 4K ctx

About

Baichuan 2, developed by Baichuan Intelligent Technology, is an advanced series of open-source large language models. It has been trained on a substantial corpus of 2.6 trillion tokens, showing remarkable performance across various Chinese and English benchmarks. The model family comprises 7B and 13B parameter versions, offered in both base and chat configurations, and includes a 4-bit quantized chat model for efficient deployment. These models are available for both academic research and commercial use under an official license. Leveraging PyTorch 2.0's F.scaled_dot_product_attention, they ensure faster inference. Additionally, intermediate checkpoints from its training are provided, supporting research into model training dynamics 23.

Specifications(3 models)

Baichuan model specifications comparison
ModelReleasedContextParameters
Baichuan 13B Chat2023-0613B
Baichuan 13B2023-064K13B
Baichuan 7B2023-064K7B

Frequently Asked Questions

What is Baichuan?
Baichuan 2, developed by Baichuan Intelligent Technology, is an advanced series of open-source large language models. It has been trained on a substantial corpus of 2.6 trillion tokens, showing remarkable performance across various Chinese and English benchmarks. The model family comprises 7B and 13B parameter versions, offered in both base and chat configurations, and includes a 4-bit quantized chat model for efficient deployment. These models are available for both academic research and commercial use under an official license. Leveraging PyTorch 2.0's F.scaled_dot_product_attention, they ensure faster inference. Additionally, intermediate checkpoints from its training are provided, supporting research into model training dynamics 23.
How many models are in the Baichuan family?
The Baichuan family contains 3 models.
What is the latest Baichuan model?
The latest model is Baichuan 13B Chat, released in 2023-06.

Models(3)