Nemotron 3 Nano

Name: Nemotron 3 Nano
Author: NVIDIA AI

Released

2025-12-15

Last refreshed

2026-05-14

Status

Researched 79d ago

Open weightsCommercial use: permittedRAGAgentsLong contextJSON / Tool use

Nemotron 3 Nano is worth evaluating for rag, agents, and long context when its provider route and context window match the workload.

Use it for

Teams evaluating rag, agents, and long context
Workloads that can use a 256k context window
Buyers comparing 1 tracked provider route

Do not use it for

Vision or document-understanding workloads

Specifications

Family: Nemotron 3
Released: 2025-12-15
Context: 256k
Parameters: 3.97B
Architecture: Mixture of Experts
Specialization: general
Openness: Open weights
License: NVIDIA Open ModelCommercial use: permitted
Weights: Available
Code: Unknown
Training: Pretrained

Created by

NVIDIA AI

Accelerated AI for enterprise solutions

Santa Clara, California, United States

Founded 2015

Website

Pricing

Output / 1M

Input / 1M

Cheapest of 1 route · NVIDIA NIM

Providers(1)

NVIDIA NIM

View 1 provider route

About

NVIDIA's lightweight 3.97B parameter model optimized for edge deployment with FP8 quantization (W8A8 mixed precision). Designed for agentic AI applications including gaming NPCs, local voice assistants, and IoT automation. Supports instruction following, tool use, and hallucination avoidance. Strong performance on BFCL, IFBench, IFEval, HaluEval, RULER, Tau2, AIME25, MATH500, GPQA-D, and LiveCodeBench.

Nemotron 3 Nano is an open-weight model in the Nemotron 3 family. The structured metadata tracks a 256k-token context window, function calling, and tool use. This page tracks provider routes through NVIDIA NIM. No headline benchmark score is tracked for Nemotron 3 Nano yet.

Top use-case fit: coding, agents, and build tasks

RAG

Included by capability and metadata signals in the decision map.

Agents

Included by capability and metadata signals in the decision map.

Long context

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Route
NVIDIA NIM	-	-	ServerlessPartial

Available via routers & gateways(1)

NVIDIA LLM Router Blueprint

Router

NVIDIA's open-source AI blueprint for LLM routing that selects the optimal model per prompt via intent classification or neural auto-routing; being deprecated 2026-06-20.

Free OSSNVIDIA NIM