LLM ReferenceLLM Reference
NVIDIA NIM

Nemotron 3 Nano on NVIDIA NIM

Nemotron 3 · NVIDIA AI

Serverless

Last refreshed 2026-05-14. Next refresh: weekly.

Why use Nemotron 3 Nano on NVIDIA NIM?

NVIDIA NIM offers Nemotron 3 Nano with competitive pricing. NVIDIA NIM is NVIDIA's deployment platform for GPU-accelerated inference microservices.

Input / 1M
-
Output / 1M
-
Cache
Not sourced
Batch
Not sourced

Setup recipe

Docs fallback
Install
Use the provider REST API or SDK
Auth
Create a provider API key
Call
model: nvidia/nemotron-3-nano-30b-a3b
Model ID
nvidia/nemotron-3-nano-30b-a3b

Request example

Curated snippets for this provider are not sourced yet. Use NVIDIA NIM documentation with model ID nvidia/nemotron-3-nano-30b-a3b.

Gotchas

  • Use provider model ID "nvidia/nemotron-3-nano-30b-a3b", not the LLMReference slug "nemotron-3-nano".

Pricing

TypeRate
GPU Hour Rate$1.00/GPU·hr
GPU Config1xH100

Capabilities

Function CallingTool Use

About Nemotron 3 Nano

NVIDIA's lightweight 3.97B parameter model optimized for edge deployment with FP8 quantization (W8A8 mixed precision). Designed for agentic AI applications including gaming NPCs, local voice assistants, and IoT automation. Supports instruction following, tool use, and hallucination avoidance. Strong performance on BFCL, IFBench, IFEval, HaluEval, RULER, Tau2, AIME25, MATH500, GPQA-D, and LiveCodeBench.

FAQ

What is the context window for Nemotron 3 Nano on NVIDIA NIM?

Nemotron 3 Nano supports a 256,000 token context window on NVIDIA NIM.

What API model ID do I use for Nemotron 3 Nano on NVIDIA NIM?

Use the model ID nvidia/nemotron-3-nano-30b-a3b when calling NVIDIA NIM's API.

Who created Nemotron 3 Nano?

Nemotron 3 Nano was created by NVIDIA AI as part of the Nemotron 3 model family.

Is Nemotron 3 Nano open source?

Nemotron 3 Nano is open source under Apache 2.0 according to the seed data.

Get Started

Model Specs

Released2025-12-15
Parameters3.97B
Context256K
ArchitectureMixture of Experts