LLM ReferenceLLM Reference

Nemotron Nano 2 Models by NVIDIA AI

6 models2025Up to 4K ctxFrom $0.04/1M input

About

Second generation Nemotron Nano with Hybrid Transformer-Mamba architecture

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

6 in view

Use when the workload needs 12B parameters, structured outputs, and multimodal inputs.

2025-1012B parametersstructured outputsmultimodal inputs

Use when the workload needs 9B parameters and structured outputs.

2025-089B parametersstructured outputs

Use when the workload needs 12B parameters.

2025-0812B parameters

Use when the workload needs 4K context and 4B parameters.

2025-044K context4B parameters

Use when the workload needs 4K context and 8B parameters.

2025-034K context8B parameters

Use when the workload needs 4K context, 8B parameters, and multimodal inputs.

2025-034K context8B parametersmultimodal inputs

Release Timeline

4 release groups
2025-10
1 current
Nemotron-Nano-12B-v2-VL
12B parametersstructured outputsmultimodal inputs
Current
2025-08
2 current
Nemotron-Nano-12B-v2
12B parameters
Current
Nemotron-Nano-9B-v2
9B parametersstructured outputs
Current
2025-04
1 current
Llama 3.1 Nemotron Nano 4B v1.1
4K context4B parameters
Current
2025-03
2 current
Llama 3.1 Nemotron Nano 8B v1
4K context8B parameters
Current
Llama 3.1 Nemotron Nano VL 8B v1
4K context8B parametersmultimodal inputs
Current

Specifications(6 models)

Nemotron Nano 2 model specifications comparison
ModelReleasedContextParametersVisionMultimodalStructured Outputs
Nemotron-Nano-12B-v2-VL2025-1012BYesYesYes
Nemotron-Nano-9B-v22025-089BNoNoYes
Nemotron-Nano-12B-v22025-0812BNoNoNo
Llama 3.1 Nemotron Nano 4B v1.12025-044K4BNoNoNo
Llama 3.1 Nemotron Nano 8B v12025-034K8BNoNoNo
Llama 3.1 Nemotron Nano VL 8B v12025-034K8BYesYesNo

Available From(2 providers)

Pricing

Nemotron Nano 2 model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Nemotron-Nano-9B-v2OpenRouter$0.04$0.16Serverless
Nemotron-Nano-12B-v2-VLOpenRouter$0.2$0.6Serverless

Frequently Asked Questions

What is Nemotron Nano 2 used for?
Nemotron Nano 2 is used for vision and multimodal work and structured outputs. The family description and listed model capabilities point to those workloads as the best fit.
How does Nemotron Nano 2 compare to NVIDIA Nemotron Nano 12B v2 VL?
Nemotron Nano 2 by NVIDIA AI is strongest where you need vision and multimodal work, while NVIDIA Nemotron Nano 12B v2 VL by NVIDIA AI is the closest related family to check for structured outputs. Nemotron Nano 2 has 6 listed variants and reaches up to 4K context, so compare the specs and pricing tables before choosing a production model.
Which Nemotron Nano 2 model should I use?
For the lowest listed input price, start with Nemotron-Nano-9B-v2 through OpenRouter at $0.04/1M input tokens. For the most capable/latest local choice, evaluate Nemotron-Nano-12B-v2-VL with structured outputs and multimodal inputs.

Models(6)