LLM Reference

Nanbeige4-3B

Released
2025-12-13
Last refreshed
2026-05-19
Status
Researched 28d ago
Open WeightsChina

Nanbeige4-3B is released 2025-12-13 in the Nanbeige4 family with open-weight; evaluate it while provider pricing coverage matures.

Use it for

  • Teams evaluating general LLM work
  • Workloads that can use a 64k context window

Do not use it for

  • Cost-sensitive launches that need sourced token pricing
  • Vision or document-understanding workloads
  • Strict JSON or tool-calling flows
Specifications
Family
Nanbeige4
Released
2025-12-13
Context
64k
Parameters
3B
Architecture
Decoder Only
Openness
Open weights
Training
Pretrained
Created by

Massive multilingual model, open-source focus.

Beijing, China
Founded 2023
Website
Pricing

No tracked provider token pricing is available yet.

About

Nanbeige4-3B is an open-source 3B-parameter language model by Nanbeige LLM Lab (BOSS Zhipin), released December 2025. Pre-trained on 23 trillion high-quality tokens with SFT on 30M+ diverse instructions. Context extended to 64K via Adjusted Base Frequency (ABF). Sets state-of-the-art on AIME 2024 (90.4), AIME 2025 (85.6), and GPQA-Diamond (82.2) for sub-10B models, outperforming models up to 10× larger including Qwen3-32B. arxiv: 2512.06266. HuggingFace: Nanbeige/Nanbeige4-3B-Base.

Nanbeige4-3B is an open-weight model in the Nanbeige4 family. The structured metadata tracks a 64k-token context window and reasoning. No headline benchmark score is tracked for Nanbeige4-3B yet.

Top use-case fit

No primary decision-task fit is mapped for this model yet.

Provider price ladder

No tracked provider token pricing is available for this model yet.

Capabilities

Reasoning

Benchmark peer barsfor Coding

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

API versions

nanbeige4-3b-basenanbeige4-3b-thinking-2511