Nemotron Nano 2 Models by NVIDIA AI
6 models2025Up to 4K ctxFrom $0.04/1M input
About
Second generation Nemotron Nano with Hybrid Transformer-Mamba architecture
Current Variants
Use-when guidance is derived from seed capabilities, context, release, and replacement fields.
6 in view
Nemotron-Nano-12B-v2-VLCurrent
Use when the workload needs 12B parameters, structured outputs, and multimodal inputs.
2025-1012B parametersstructured outputsmultimodal inputs
Nemotron-Nano-9B-v2Current
Use when the workload needs 9B parameters and structured outputs.
2025-089B parametersstructured outputs
Use when the workload needs 4K context and 4B parameters.
2025-044K context4B parameters
Use when the workload needs 4K context and 8B parameters.
2025-034K context8B parameters
Use when the workload needs 4K context, 8B parameters, and multimodal inputs.
2025-034K context8B parametersmultimodal inputs
| Model | Use when | Released | Signals | Status |
|---|---|---|---|---|
| Nemotron-Nano-12B-v2-VL | Use when the workload needs 12B parameters, structured outputs, and multimodal inputs. | 2025-10 | 12B parametersstructured outputsmultimodal inputs | Current |
| Nemotron-Nano-9B-v2 | Use when the workload needs 9B parameters and structured outputs. | 2025-08 | 9B parametersstructured outputs | Current |
| Nemotron-Nano-12B-v2 | Use when the workload needs 12B parameters. | 2025-08 | 12B parameters | Current |
| Llama 3.1 Nemotron Nano 4B v1.1 | Use when the workload needs 4K context and 4B parameters. | 2025-04 | 4K context4B parameters | Current |
| Llama 3.1 Nemotron Nano 8B v1 | Use when the workload needs 4K context and 8B parameters. | 2025-03 | 4K context8B parameters | Current |
| Llama 3.1 Nemotron Nano VL 8B v1 | Use when the workload needs 4K context, 8B parameters, and multimodal inputs. | 2025-03 | 4K context8B parametersmultimodal inputs | Current |
Release Timeline
4 release groups2025-10
1 current
Nemotron-Nano-12B-v2-VL
Current12B parametersstructured outputsmultimodal inputs
2025-08
2 current
Nemotron-Nano-12B-v2
Current12B parameters
Nemotron-Nano-9B-v2
Current9B parametersstructured outputs
2025-04
1 current
Llama 3.1 Nemotron Nano 4B v1.1
Current4K context4B parameters
2025-03
2 current
Llama 3.1 Nemotron Nano 8B v1
Current4K context8B parameters
Llama 3.1 Nemotron Nano VL 8B v1
Current4K context8B parametersmultimodal inputs
Specifications(6 models)
| Model | Released | Context | Parameters | Vision | Multimodal | Structured Outputs |
|---|---|---|---|---|---|---|
| Nemotron-Nano-12B-v2-VL | 2025-10 | — | 12B | Yes | Yes | Yes |
| Nemotron-Nano-9B-v2 | 2025-08 | — | 9B | No | No | Yes |
| Nemotron-Nano-12B-v2 | 2025-08 | — | 12B | No | No | No |
| Llama 3.1 Nemotron Nano 4B v1.1 | 2025-04 | 4K | 4B | No | No | No |
| Llama 3.1 Nemotron Nano 8B v1 | 2025-03 | 4K | 8B | No | No | No |
| Llama 3.1 Nemotron Nano VL 8B v1 | 2025-03 | 4K | 8B | Yes | Yes | No |
Available From(2 providers)
Pricing
| Model | Provider | Input / 1M | Output / 1M | Type |
|---|---|---|---|---|
| Nemotron-Nano-9B-v2 | OpenRouter | $0.04 | $0.16 | Serverless |
| Nemotron-Nano-12B-v2-VL | OpenRouter | $0.2 | $0.6 | Serverless |
Frequently Asked Questions
- What is Nemotron Nano 2 used for?
- Nemotron Nano 2 is used for vision and multimodal work and structured outputs. The family description and listed model capabilities point to those workloads as the best fit.
- How does Nemotron Nano 2 compare to NVIDIA Nemotron Nano 12B v2 VL?
- Nemotron Nano 2 by NVIDIA AI is strongest where you need vision and multimodal work, while NVIDIA Nemotron Nano 12B v2 VL by NVIDIA AI is the closest related family to check for structured outputs. Nemotron Nano 2 has 6 listed variants and reaches up to 4K context, so compare the specs and pricing tables before choosing a production model.
- Which Nemotron Nano 2 model should I use?
- For the lowest listed input price, start with Nemotron-Nano-9B-v2 through OpenRouter at $0.04/1M input tokens. For the most capable/latest local choice, evaluate Nemotron-Nano-12B-v2-VL with structured outputs and multimodal inputs.




