abab 6.5

About

The abab 6.5 series by MiniMax includes two LLMs, abab 6.5 and abab 6.5s, which feature a Mixture of Experts architecture and a remarkable context length of 200k tokens. The abab 6.5 model boasts a trillion parameters, while the abab 6.5s model excels in efficiency, processing around 30,000 characters per second. These models perform exceptionally in core competencies such as knowledge, reasoning, mathematics, and programming, achieving performance near industry-leading levels. They are particularly adept at handling long text processing and complex tasks, making them suitable for research, data analysis, and processing large text datasets. Notably, they can manage extensive context lengths, and the abab 6.5s model offers notable speed advantages. Despite these strengths, potential limitations include common LLM issues like hallucinations and biases.

Capabilities

MultimodalFunction CallingTool UseJSON Mode

Specifications

Familyabab

ArchitectureDecoder Only

Specializationgeneral