What is Nemotron 3 used for?

Nemotron 3 is used for audio, moderation, and realtime voice. The family description and listed model capabilities point to those workloads as the best fit.

How does Nemotron 3 compare to NVIDIA Nemotron Nano 12B v2 VL?

Nemotron 3 by NVIDIA AI is strongest where you need audio, while NVIDIA Nemotron Nano 12B v2 VL by NVIDIA AI is the closest related family to check for structured outputs. Nemotron 3 has 9 listed variants and reaches up to 1.05m context, so compare the specs and pricing tables before choosing a production model.

Which Nemotron 3 model should I use?

For the lowest listed input price, start with Nemotron 3 Nano 30B-A3B through Vercel AI Gateway at $0.05/1M input tokens. For the most capable/latest local choice, evaluate Nemotron 3 Nano with 256k context and tool use and function calling.

Nemotron 3 Models by NVIDIA AI

NVIDIA AINVIDIA Open ModelOpen weights

9 models2024–2026Up to 1.05m ctxFrom $0.05/1M input

Details

ResearcherNVIDIA AI

LicenseNVIDIA Open Model

Commercial useCommercial use: permitted

Models9

Released2024–2026

Max context1.05m

Capabilities

Vision2 of 9 models

Multimodal3 of 9 models

Reasoning1 of 9 models

Function Calling1 of 9 models

Tool Use1 of 9 models

Structured Outputs2 of 9 models

Links

Website HuggingFace

About

NVIDIA Nemotron 3 is the 2025-2026 open model family covering Nano 30B-A3B, Super 120B-A12B, Content Safety 4B, VoiceChat 12B, and Nano Omni variants for agentic reasoning, safety classification, and multimodal deployment.

Current Variants

Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.

9 in view

Nemotron 3 UltraCurrent

Use when the workload needs 1m context, 550B parameters, and reasoning.

2026-061m context550B parametersreasoning

Nemotron 3 Nano OmniCurrent

Use when the workload needs audio, 262k context, and 30B parameters.

2026-04audio262k context30B parameters

Nemotron 3 Content SafetyCurrent

Use when the workload needs moderation, 131k context, and 4B parameters.

2026-03moderation131k context4B parameters

Nemotron 3 VoiceChatCurrent

Use when the workload needs realtime voice, 12B parameters, and multimodal inputs.

2026-03realtime voice12B parametersmultimodal inputs

Nemotron 3 Super-120B-A12BCurrent

Use when the workload needs 1.05m context, 120B parameters, and structured outputs.

2026-031.05m context120B parametersstructured outputs

Nemotron 3 NanoCurrent

Use when the workload needs 256k context, 4.0B parameters, and tool use.

2025-12256k context4.0B parameterstool use

Nemotron 3 Nano 30B-A3BCurrent

Use when the workload needs structured outputs.

2025-12structured outputs

Llama 3.3 Nemotron Super 49B v1Current

Use when the workload needs 128k context and 49B parameters.

2025-06128k context49B parameters

Llama 3.1 Nemotron 70B RewardCurrent

Use when the workload needs safety, 4k context, and 70B parameters.

2024-10safety4k context70B parameters

Current Nemotron 3 variants with use-when guidance and lifecycle status
Model	Use when	Released	Signals	Status
Nemotron 3 Ultra	Use when the workload needs 1m context, 550B parameters, and reasoning.	2026-06	1m context550B parametersreasoning	Current
Nemotron 3 Nano Omni	Use when the workload needs audio, 262k context, and 30B parameters.	2026-04	audio262k context30B parameters	Current
Nemotron 3 Content Safety	Use when the workload needs moderation, 131k context, and 4B parameters.	2026-03	moderation131k context4B parameters	Current
Nemotron 3 VoiceChat	Use when the workload needs realtime voice, 12B parameters, and multimodal inputs.	2026-03	realtime voice12B parametersmultimodal inputs	Current
Nemotron 3 Super-120B-A12B	Use when the workload needs 1.05m context, 120B parameters, and structured outputs.	2026-03	1.05m context120B parametersstructured outputs	Current
Nemotron 3 Nano	Use when the workload needs 256k context, 4.0B parameters, and tool use.	2025-12	256k context4.0B parameterstool use	Current
Nemotron 3 Nano 30B-A3B	Use when the workload needs structured outputs.	2025-12	structured outputs	Current
Llama 3.3 Nemotron Super 49B v1	Use when the workload needs 128k context and 49B parameters.	2025-06	128k context49B parameters	Current
Llama 3.1 Nemotron 70B Reward	Use when the workload needs safety, 4k context, and 70B parameters.	2024-10	safety4k context70B parameters	Current

Release Timeline

6 release groups

2026-06

1 current

Nemotron 3 Ultra

1m context550B parametersreasoning

Current

2026-04

1 current

Nemotron 3 Nano Omni

audio262k context30B parameters

Current

2026-03

3 current

Nemotron 3 Content Safety

moderation131k context4B parameters

Current

Nemotron 3 Super-120B-A12B

1.05m context120B parametersstructured outputs

Current

Nemotron 3 VoiceChat

realtime voice12B parametersmultimodal inputs

Current

2025-12

2 current

Nemotron 3 Nano

256k context4.0B parameterstool use

Current

Nemotron 3 Nano 30B-A3B

structured outputs

Current

2025-06

1 current

Llama 3.3 Nemotron Super 49B v1

128k context49B parameters

Current

2024-10

1 current

Llama 3.1 Nemotron 70B Reward

safety4k context70B parameters

Current

Specifications(9 models)

Nemotron 3 model specifications comparison
Model	Released	Context	Parameters	Vision	Multimodal	Reasoning	Fn Calling	Tool Use	Structured Outputs
Nemotron 3 Ultra	2026-06	1m	550B	No	No	Yes	No	No	No
Nemotron 3 Nano Omni	2026-04	262k	30B	No	Yes	No	No	No	No
Nemotron 3 Content Safety	2026-03	131k	4B	Yes	Yes	No	No	No	No
Nemotron 3 VoiceChat	2026-03	—	12B	Yes	Yes	No	No	No	No
Nemotron 3 Super-120B-A12B	2026-03	1.05m	120B	No	No	No	No	No	Yes
Nemotron 3 Nano	2025-12	256k	3.97B	No	No	No	Yes	Yes	No
Nemotron 3 Nano 30B-A3B	2025-12	—	30B (3B active)	No	No	No	No	No	Yes
Llama 3.3 Nemotron Super 49B v1	2025-06	128k	49B	No	No	No	No	No	No
Llama 3.1 Nemotron 70B Reward	2024-10	4k	70B	No	No	No	No	No	No

Available From(7 providers)

AWS Bedrock

Cloudflare Workers AI

Pricing

Nemotron 3 model pricing by provider
Model	Provider	Input / 1M	Output / 1M	Type
Nemotron 3 Nano 30B-A3B	Vercel AI Gateway	$0.05	$0.24	Serverless
Nemotron 3 Nano 30B-A3B	AWS Bedrock	$0.06	$0.24	Serverless
Nemotron 3 Super-120B-A12B	OpenRouter	$0.09	$0.45	Serverless
Nemotron 3 Super-120B-A12B	DeepInfra	$0.1	$0.5	Serverless
Nemotron 3 Super-120B-A12B	NVIDIA NIM	$0.1	$0.5	Serverless
Nemotron 3 Super-120B-A12B	Vercel AI Gateway	$0.15	$0.65	Serverless
Nemotron 3 Ultra	OpenRouter	$0.5	$2.2	Serverless

Popular comparisons in this family

Frequently Asked Questions

What is Nemotron 3 used for?: Nemotron 3 is used for audio, moderation, and realtime voice. The family description and listed model capabilities point to those workloads as the best fit.
How does Nemotron 3 compare to NVIDIA Nemotron Nano 12B v2 VL?: Nemotron 3 by NVIDIA AI is strongest where you need audio, while NVIDIA Nemotron Nano 12B v2 VL by NVIDIA AI is the closest related family to check for structured outputs. Nemotron 3 has 9 listed variants and reaches up to 1.05m context, so compare the specs and pricing tables before choosing a production model.
Which Nemotron 3 model should I use?: For the lowest listed input price, start with Nemotron 3 Nano 30B-A3B through Vercel AI Gateway at $0.05/1M input tokens. For the most capable/latest local choice, evaluate Nemotron 3 Nano with 256k context and tool use and function calling.

Models(9)

Nemotron 3 Ultra

2026-061m550B1 provider

ReasoningOpen Weights

Nemotron 3 Nano Omni

2026-04262k30B1 provider

MultimodalOpen Weights

Nemotron 3 Content Safety

2026-03131k4B

MultimodalOpen Weights

Nemotron 3 VoiceChat

2026-0312B

MultimodalOpen Weights

Nemotron 3 Super-120B-A12B

2026-031.05m120B6 providers

Open Weights

Nemotron 3 Nano

2025-12256k3.97B1 provider

Open Weights

Nemotron 3 Nano 30B-A3B

2025-1230B (3B active)2 providers

Open Weights

Llama 3.3 Nemotron Super 49B v1

2025-06128k49B1 provider

Open Weights

Llama 3.1 Nemotron 70B Reward

2024-104k70B1 provider

Open Weights

Nemotron 3 Models by NVIDIA AI

Details

Capabilities

Links

About

Current Variants

Release Timeline

Specifications(9 models)

Available From(7 providers)

Pricing

Popular comparisons in this family

Frequently Asked Questions

Related Model Families

Models(9)