What is Aquila 2 used for?

Aquila 2 is used for coding, chatbot and role-playing use cases, and long-context generation. The family description and listed model capabilities point to those workloads as the best fit.

How does Aquila 2 compare to BGE?

Aquila 2 by Beijing Academy of Artificial Intelligence (BAAI) is strongest where you need coding, while BGE by Beijing Academy of Artificial Intelligence (BAAI) is the closest related family to check for embedding. Aquila 2 has 8 listed variants and reaches up to 16k context, while BGE reaches up to 8k context, so compare the specs and pricing tables before choosing a production model.

Which Aquila 2 model should I use?

If price is the main constraint, use the pricing table first because Aquila 2 does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate Aquila Chat 2 34B-16K with 16k context.

Aquila 2 Models by Beijing Academy of Artificial Intelligence (BAAI)

Beijing Academy of Artificial Intelligence (BAAI)

8 models2023Up to 16k ctx

About

The Aquila 2 family is a series of bilingual large language models designed to proficiently handle Chinese and English languages. These models vary significantly in size, from 7 billion to 70 billion parameters, offering a broad range for different computational needs. They are developed using the advanced HeuriMentor (HM) framework, which enhances the training process by dynamically adjusting data distributions, thereby boosting efficiency and model performance. This framework includes components such as the Adaptive Training Engine (ATE) and Training State Monitor (TSM). The Aquila 2 models consistently perform well on key benchmarks and have been open-sourced to encourage further innovation. Notably, the Aquila2-34B model retains performance standards even when quantized to Int4 format, highlighting its efficiency and accuracy. The AquilaChat2 variants are specially fine-tuned for conversational applications, demonstrating the versatility of the Aquila 2 family 14.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

8 in view

Aquila Chat 2 70B ExpressiveCurrent

Use when the workload needs 2k context and 70B parameters.

2023-112k context70B parameters

Aquila Chat 2 34BCurrent

Use when the workload needs 2k context and 34B parameters.

2023-112k context34B parameters

Aquila Chat 2 7BCurrent

Use when the workload needs 2k context and 7B parameters.

2023-112k context7B parameters

Aquila Chat 2 34B-16KCurrent

Use when the workload needs 16k context and 34B parameters.

2023-1116k context34B parameters

Aquila Chat 2 7B-16KCurrent

Use when the workload needs 16k context and 7B parameters.

2023-1116k context7B parameters

Aquila 2 70B ExpressiveCurrent

Use when the workload needs 2k context and 70B parameters.

2023-112k context70B parameters

Aquila 2 34BCurrent

Use when the workload needs 2k context and 34B parameters.

2023-112k context34B parameters

Aquila 2 7BCurrent

Use when the workload needs 2k context and 7B parameters.

2023-112k context7B parameters

Current Aquila 2 variants with use-when guidance and lifecycle status
Model	Use when	Released	Signals	Status
Aquila Chat 2 70B Expressive	Use when the workload needs 2k context and 70B parameters.	2023-11	2k context70B parameters	Current
Aquila Chat 2 34B	Use when the workload needs 2k context and 34B parameters.	2023-11	2k context34B parameters	Current
Aquila Chat 2 7B	Use when the workload needs 2k context and 7B parameters.	2023-11	2k context7B parameters	Current
Aquila Chat 2 34B-16K	Use when the workload needs 16k context and 34B parameters.	2023-11	16k context34B parameters	Current
Aquila Chat 2 7B-16K	Use when the workload needs 16k context and 7B parameters.	2023-11	16k context7B parameters	Current
Aquila 2 70B Expressive	Use when the workload needs 2k context and 70B parameters.	2023-11	2k context70B parameters	Current
Aquila 2 34B	Use when the workload needs 2k context and 34B parameters.	2023-11	2k context34B parameters	Current
Aquila 2 7B	Use when the workload needs 2k context and 7B parameters.	2023-11	2k context7B parameters	Current

Release Timeline

1 release group

2023-11

8 current

Aquila 2 34B

2k context34B parameters

Current

Aquila 2 70B Expressive

2k context70B parameters

Current

Aquila 2 7B

2k context7B parameters

Current

Aquila Chat 2 34B

2k context34B parameters

Current

Aquila Chat 2 34B-16K

16k context34B parameters

Current

Aquila Chat 2 70B Expressive

2k context70B parameters

Current

Aquila Chat 2 7B

2k context7B parameters

Current

Aquila Chat 2 7B-16K

16k context7B parameters

Current

Specifications(8 models)

Aquila 2 model specifications comparison
Model	Released	Context	Parameters
Aquila Chat 2 70B Expressive	2023-11	2k	70B
Aquila Chat 2 34B	2023-11	2k	34B
Aquila Chat 2 7B	2023-11	2k	7B
Aquila Chat 2 34B-16K	2023-11	16k	34B
Aquila Chat 2 7B-16K	2023-11	16k	7B
Aquila 2 70B Expressive	2023-11	2k	70B
Aquila 2 34B	2023-11	2k	34B
Aquila 2 7B	2023-11	2k	7B

Frequently Asked Questions

What is Aquila 2 used for?: Aquila 2 is used for coding, chatbot and role-playing use cases, and long-context generation. The family description and listed model capabilities point to those workloads as the best fit.
How does Aquila 2 compare to BGE?: Aquila 2 by Beijing Academy of Artificial Intelligence (BAAI) is strongest where you need coding, while BGE by Beijing Academy of Artificial Intelligence (BAAI) is the closest related family to check for embedding. Aquila 2 has 8 listed variants and reaches up to 16k context, while BGE reaches up to 8k context, so compare the specs and pricing tables before choosing a production model.
Which Aquila 2 model should I use?: If price is the main constraint, use the pricing table first because Aquila 2 does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate Aquila Chat 2 34B-16K with 16k context.