Zamba 2
About
The Zamba 2 family of large language models (LLMs) by Zyphra represents a novel integration of state-space Mamba and transformer blocks to achieve optimal performance, especially in low-resource settings. This architecture, built on a Mamba backbone, strategically alternates with transformer blocks to reduce parameters and conserve memory. Enhancements over the previous version include new Mamba2 blocks and dual interleaved attention layers, as well as LoRA projectors to tailor MLPs. The family offers models like the 2.7B and 7B ones, with the larger 7B version performing exceptionally well against peers of similar scale. Zamba2-7B-Instruct, a fine-tuned model variant, extends context length to 16,000 tokens, enhancing its prowess on instruction-following tasks. All models are open-source under the Apache 2.0 license, further promoting accessibility and innovation 235.
Specifications(6 models)
| Model | Released | Context | Parameters |
|---|---|---|---|
| Zamba2 7B | 2024-06 | — | 7B |
| Zamba2 7B Instruct | 2024-06 | 16K | 7B |
| Zamba2 2.7B | 2024-06 | — | 2.7B |
| Zamba2 2.7B Instruct | 2024-06 | — | 2.7B |
| Zamba2 1.2B | 2024-06 | — | 1.2B |
| Zamba2 1.2B Instruct | 2024-06 | — | 1.2B |
Frequently Asked Questions
- What is Zamba 2?
- The Zamba 2 family of large language models (LLMs) by Zyphra represents a novel integration of state-space Mamba and transformer blocks to achieve optimal performance, especially in low-resource settings. This architecture, built on a Mamba backbone, strategically alternates with transformer blocks to reduce parameters and conserve memory. Enhancements over the previous version include new Mamba2 blocks and dual interleaved attention layers, as well as LoRA projectors to tailor MLPs. The family offers models like the 2.7B and 7B ones, with the larger 7B version performing exceptionally well against peers of similar scale. Zamba2-7B-Instruct, a fine-tuned model variant, extends context length to 16,000 tokens, enhancing its prowess on instruction-following tasks. All models are open-source under the Apache 2.0 license, further promoting accessibility and innovation 235.
- How many models are in the Zamba 2 family?
- The Zamba 2 family contains 6 models.
- What is the latest Zamba 2 model?
- The latest model is Zamba2 7B, released in 2024-06.

