LLM Reference

Nemotron-Labs-Diffusion Models by NVIDIA AI

NVIDIA AINVIDIA Open ModelOpen weights
3 models2026Up to 131k ctx

Details

ResearcherNVIDIA AI
Commercial useCommercial use: permitted
Models3
Released2026
Max context131k

About

NVIDIA Nemotron-Labs-Diffusion is a family of diffusion language models (DLMs) released by NVIDIA Research in May 2026. Unlike traditional autoregressive models, they generate text by producing multiple tokens in parallel and iteratively refining them, enabling up to 6× throughput over comparable AR models. Available in 3B, 8B, and 14B sizes with base and instruct variants; an 8B VLM is also included.

Current Variants

Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.

3 in view

Use when the workload needs 131k context and 3B parameters.

2026-05131k context3B parameters

Use when the workload needs 131k context and 8B parameters.

2026-05131k context8B parameters

Use when the workload needs 131k context, 14B parameters, and fine tuning.

2026-05131k context14B parametersfine tuning

Release Timeline

1 release group
2026-05
3 current
Nemotron-Labs-Diffusion 14B
131k context14B parametersfine tuning
Current
Nemotron-Labs-Diffusion 3B
131k context3B parameters
Current
Nemotron-Labs-Diffusion 8B
131k context8B parameters
Current

Specifications(3 models)

Nemotron-Labs-Diffusion model specifications comparison
ModelReleasedContextParameters
Nemotron-Labs-Diffusion 3B2026-05131k3B
Nemotron-Labs-Diffusion 8B2026-05131k8B
Nemotron-Labs-Diffusion 14B2026-05131k14B

Frequently Asked Questions

What is Nemotron-Labs-Diffusion used for?
Nemotron-Labs-Diffusion is used for coding. The family description and listed model capabilities point to those workloads as the best fit.
How does Nemotron-Labs-Diffusion compare to NVIDIA Nemotron Nano 12B v2 VL?
Nemotron-Labs-Diffusion by NVIDIA AI is strongest where you need coding, while NVIDIA Nemotron Nano 12B v2 VL by NVIDIA AI is the closest related family to check for structured outputs. Nemotron-Labs-Diffusion has 3 listed variants and reaches up to 131k context, so compare the specs and pricing tables before choosing a production model.
Which Nemotron-Labs-Diffusion model should I use?
If price is the main constraint, use the pricing table first because Nemotron-Labs-Diffusion does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate Nemotron-Labs-Diffusion 3B with 131k context.