Nanbeige4-3B
Nanbeige4-3B is released 2025-12-13 in the Nanbeige4 family with open-weight; evaluate it while provider pricing coverage matures.
Use it for
- Teams evaluating general LLM work
- Workloads that can use a 64k context window
Do not use it for
- Cost-sensitive launches that need sourced token pricing
- Vision or document-understanding workloads
- Strict JSON or tool-calling flows
- Family
- Nanbeige4
- Released
- 2025-12-13
- Context
- 64k
- Parameters
- 3B
- Architecture
- Decoder Only
- Openness
- Open weights
- Training
- Pretrained
No tracked provider token pricing is available yet.
About
Nanbeige4-3B is an open-source 3B-parameter language model by Nanbeige LLM Lab (BOSS Zhipin), released December 2025. Pre-trained on 23 trillion high-quality tokens with SFT on 30M+ diverse instructions. Context extended to 64K via Adjusted Base Frequency (ABF). Sets state-of-the-art on AIME 2024 (90.4), AIME 2025 (85.6), and GPQA-Diamond (82.2) for sub-10B models, outperforming models up to 10× larger including Qwen3-32B. arxiv: 2512.06266. HuggingFace: Nanbeige/Nanbeige4-3B-Base.
Nanbeige4-3B is an open-weight model in the Nanbeige4 family. The structured metadata tracks a 64k-token context window and reasoning. No headline benchmark score is tracked for Nanbeige4-3B yet.
Top use-case fit
No primary decision-task fit is mapped for this model yet.
Provider price ladder
No tracked provider token pricing is available for this model yet.
Capabilities
Benchmark peer barsfor Coding
No task-mapped benchmark peers are available for this model yet.
Migration checks
No linked migration route is available for this model yet.
API versions
nanbeige4-3b-basenanbeige4-3b-thinking-2511No tracked provider token pricing is available yet.