LLM ReferenceLLM Reference

Nemotron-3

9 models2024–2026Up to 1M ctxFrom $0.09/1M input

About

The Nemotron-3 8B family of foundation models is tailored for enterprise-level AI solutions, enabling the creation of generative AI applications. This collection features types including base, chat, and Q&A models, each optimized for distinct tasks. The Nemotron-3-8B-Base model is designed for customization, supporting fine-tuning and continuous pretraining to fit specific domain needs. Chat variants—Nemotron-3-8B-Chat-SFT, Chat-RLHF, and Chat-SteerLM—are built for chatbot deployment with varying customization levels. The Nemotron-3-8B-QA model excels in question-and-answer contexts. All models integrate with the NVIDIA NeMo framework for seamless customization and deployment, supporting 53 human and 37 programming languages, and are trained on datasets with trillions of tokens 145.

Specifications(9 models)

Nemotron-3 model specifications comparison
ModelReleasedContextParametersVisionMultimodalFn CallingTool UseStructured Outputs
Nemotron 3 Nano2026-03256K3.97BNoNoYesYesNo
Nemotron 3 VoiceChat2026-0312BYesYesNoNoNo
Nemotron 3 Super-120B-A12B2026-031M120BNoNoNoNoYes
Nemotron 3 8B2026-034K8BNoNoNoNoNo
Nemotron 3 Super2026-031M120.6B (12.7B active)NoNoNoNoNo
NVIDIA Nemotron 3 Super 120B2026-03262K120B total, 12B activeNoNoNoNoYes
Llama 3.3 Nemotron Super 49B v12025-06128K49BNoNoNoNoNo
Llama 3.1 Nemotron 70B Reward2024-104K70BNoNoNoNoNo
Nemotron 3 Ultra2024-09128KNoNoNoNoNo

Available From(4 providers)

Pricing

Nemotron-3 model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Nemotron 3 Super-120B-A12BOpenRouter$0.09$0.45Serverless
NVIDIA Nemotron 3 Super 120BDeepInfra$0.1$0.5Serverless
Nemotron 3 8BMicrosoft Foundry$0.37$1.1Provisioned

Frequently Asked Questions

What is Nemotron-3?
The Nemotron-3 8B family of foundation models is tailored for enterprise-level AI solutions, enabling the creation of generative AI applications. This collection features types including base, chat, and Q&A models, each optimized for distinct tasks. The Nemotron-3-8B-Base model is designed for customization, supporting fine-tuning and continuous pretraining to fit specific domain needs. Chat variants—Nemotron-3-8B-Chat-SFT, Chat-RLHF, and Chat-SteerLM—are built for chatbot deployment with varying customization levels. The Nemotron-3-8B-QA model excels in question-and-answer contexts. All models integrate with the NVIDIA NeMo framework for seamless customization and deployment, supporting 53 human and 37 programming languages, and are trained on datasets with trillions of tokens 145.
How many models are in the Nemotron-3 family?
The Nemotron-3 family contains 9 models.
What is the latest Nemotron-3 model?
The latest model is Nemotron 3 Nano, released in 2026-03.
How much does Nemotron-3 cost?
Nemotron-3 models range from $0.09/1M to $0.37/1M input tokens depending on the model and provider.
Is Nemotron-3 open source?
2 of 9 Nemotron-3 models are open source.

Models(9)