Falcon Mamba 7B
falcon-mamba-7b
Last refreshed 2026-05-25. Next refresh: weekly.
Falcon Mamba 7B has model metadata, but missing tracked provider pricing keeps it from being a default production pick.
Decision context: Coding task fit, 0 tracked provider routes, and research from 2026-05-25.
Use it for
- Teams evaluating general LLM work
- Workloads that can use a 8K context window
Do not use it for
- Cost-sensitive launches that need sourced token pricing
- Vision or document-understanding workloads
- Strict JSON or tool-calling flows
Cheapest output
-
No tracked output price
Provider routes
0
No provider route in seed
Quality / dollar
Unknown
No task benchmark coverage yet
Freshness
2026-05-25
Researched today
Top use-case fit
No primary decision-task fit is mapped for this model yet.
Provider price ladder
No tracked provider token pricing is available for this model yet.
Benchmark peer barsfor Coding
No task-mapped benchmark peers are available for this model yet.
Migration checks
No linked migration route is available for this model yet.
About
Falcon Mamba 7B is TII's first open-weight state-space language model (SSLM), released August 12, 2024. The first competitive attention-free 7B model, verified by HuggingFace as the no.1 open-source SSLM globally at release. Built on the Mamba architecture with extra RMS normalization layers for stable training at scale. Outperforms Llama 3.1 8B and Mistral 7B on several benchmarks despite using no attention. 65K vocabulary, 64 layers, 4096 hidden dimension. Trained on approximately 5.5 trillion tokens from RefinedWeb and Fineweb-edu. Succeeded by Falcon3-Mamba-7B (December 2024).
Falcon Mamba 7B has a 8K-token context window.
Capabilities
No model capability flags are currently sourced.
Specifications
Created by
Innovative open-source AI for global impact