Nemotron-Labs-Diffusion 14B
Nemotron-Labs-Diffusion 14B is a released long context model with open-weight and 131k context; evaluate it while provider pricing coverage matures.
Use it for
- Teams evaluating long context
- Workloads that can use a 131k context window
Do not use it for
- Cost-sensitive launches that need sourced token pricing
- Vision or document-understanding workloads
- Strict JSON or tool-calling flows
- Family
- Nemotron-Labs-Diffusion
- Released
- 2026-05-23
- Context
- 131k
- Parameters
- 14B
- Architecture
- Decoder Only
- Specialization
- general
- Openness
- Open weights
- License
- NVIDIA Open ModelCommercial use: permitted
- Training
- Pretrained
No tracked provider token pricing is available yet.
About
NVIDIA Nemotron-Labs-Diffusion 14B is the largest text model in NVIDIA Research's diffusion language model family, released May 23, 2026. Uses diffusion-based parallel decoding enabling up to 6× higher throughput versus autoregressive baselines, with three decoding modes: autoregressive, diffusion, and self-speculation. Training code released through NVIDIA Megatron Bridge framework alongside the weights, enabling fine-tuning. Released under NVIDIA Nemotron Open Model License (commercially usable open weights). Available on Hugging Face at nvidia/Nemotron-Labs-Diffusion-14B.
Nemotron-Labs-Diffusion 14B is an open-weight model in the Nemotron-Labs-Diffusion family. The structured metadata tracks a 131k-token context window. No headline benchmark score is tracked for Nemotron-Labs-Diffusion 14B yet.
Top use-case fit
Long context
Included by capability and metadata signals in the decision map.
Provider price ladder
No tracked provider token pricing is available for this model yet.
Capabilities
Benchmark peer barsfor Long context
No task-mapped benchmark peers are available for this model yet.
Migration checks
No linked migration route is available for this model yet.
No tracked provider token pricing is available yet.