LLM Reference

Aquila Chat 2 7B

About

The AquilaChat2-7B is a sophisticated large language model (LLM) from BAAI, designed to excel in bilingual tasks, supporting both Chinese and English languages seamlessly. It's part of the Aquila2 series, featuring enhanced architectures reminiscent of GPT-3 and LLaMA, alongside a refined bilingual tokenizer and improved operator implementations. This model is tailored for conversational AI, offering impressive capabilities in generating fluent dialogue and performing various language tasks. Notable for its rapid training efficiency—nearly eight times faster than other models like Magtron+DeepSpeed ZeRO-2—it operates under the BAAI Aquila Model License Agreement, with allowances for commercial use under specific conditions. Moreover, it adheres to Chinese domestic data regulations and is open-source, fostering integration with other models and tools through special instruction specifications, while also being optimized for high performance with more compact datasets.

Capabilities

MultimodalFunction CallingTool UseJSON Mode

Specifications

FamilyAquila 2
Parameters7B
ArchitectureDecoder Only
Specializationgeneral