LLM Reference

Baichuan Models by Baichuan Intelligent Technology

3 models2023Up to 4k ctx

About

Baichuan 2, developed by Baichuan Intelligent Technology, is an advanced series of open-source large language models. It has been trained on a substantial corpus of 2.6 trillion tokens, showing remarkable performance across various Chinese and English benchmarks. The model family comprises 7B and 13B parameter versions, offered in both base and chat configurations, and includes a 4-bit quantized chat model for efficient deployment. These models are available for both academic research and commercial use under an official license. Leveraging PyTorch 2.0's F.scaled_dot_product_attention, they ensure faster inference. Additionally, intermediate checkpoints from its training are provided, supporting research into model training dynamics 23.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

3 in view

Use when the workload needs 4k context and 13B parameters.

2023-064k context13B parameters

Use when the workload needs 4k context and 13B parameters.

2023-064k context13B parameters

Use when the workload needs 4k context and 7B parameters.

2023-064k context7B parameters

Release Timeline

1 release group
2023-06
3 current
Baichuan 13B
4k context13B parameters
Current
Baichuan 13B Chat
4k context13B parameters
Current
Baichuan 7B
4k context7B parameters
Current

Specifications(3 models)

Baichuan model specifications comparison
ModelReleasedContextParameters
Baichuan 13B Chat2023-064k13B
Baichuan 13B2023-064k13B
Baichuan 7B2023-064k7B

Frequently Asked Questions

What is Baichuan used for?
Baichuan is used for coding. The family description and listed model capabilities point to those workloads as the best fit.
How does Baichuan compare to Baichuan 4?
Baichuan by Baichuan Intelligent Technology is strongest where you need coding, while Baichuan 4 by Baichuan Intelligent Technology is the closest related family to check for adjacent model selection. Baichuan has 3 listed variants and reaches up to 4k context, so compare the specs and pricing tables before choosing a production model.
Which Baichuan model should I use?
If price is the main constraint, use the pricing table first because Baichuan does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate Baichuan 13B with 4k context.

Models(3)