LLM ReferenceLLM Reference

Zamba 2

6 models2024Up to 16K ctx

About

The Zamba 2 family of large language models (LLMs) by Zyphra represents a novel integration of state-space Mamba and transformer blocks to achieve optimal performance, especially in low-resource settings. This architecture, built on a Mamba backbone, strategically alternates with transformer blocks to reduce parameters and conserve memory. Enhancements over the previous version include new Mamba2 blocks and dual interleaved attention layers, as well as LoRA projectors to tailor MLPs. The family offers models like the 2.7B and 7B ones, with the larger 7B version performing exceptionally well against peers of similar scale. Zamba2-7B-Instruct, a fine-tuned model variant, extends context length to 16,000 tokens, enhancing its prowess on instruction-following tasks. All models are open-source under the Apache 2.0 license, further promoting accessibility and innovation 235.

Specifications(6 models)

Zamba 2 model specifications comparison
ModelReleasedContextParameters
Zamba2 7B2024-067B
Zamba2 7B Instruct2024-0616K7B
Zamba2 2.7B2024-062.7B
Zamba2 2.7B Instruct2024-062.7B
Zamba2 1.2B2024-061.2B
Zamba2 1.2B Instruct2024-061.2B

Frequently Asked Questions

What is Zamba 2?
The Zamba 2 family of large language models (LLMs) by Zyphra represents a novel integration of state-space Mamba and transformer blocks to achieve optimal performance, especially in low-resource settings. This architecture, built on a Mamba backbone, strategically alternates with transformer blocks to reduce parameters and conserve memory. Enhancements over the previous version include new Mamba2 blocks and dual interleaved attention layers, as well as LoRA projectors to tailor MLPs. The family offers models like the 2.7B and 7B ones, with the larger 7B version performing exceptionally well against peers of similar scale. Zamba2-7B-Instruct, a fine-tuned model variant, extends context length to 16,000 tokens, enhancing its prowess on instruction-following tasks. All models are open-source under the Apache 2.0 license, further promoting accessibility and innovation 235.
How many models are in the Zamba 2 family?
The Zamba 2 family contains 6 models.
What is the latest Zamba 2 model?
The latest model is Zamba2 7B, released in 2024-06.

Models(6)