LLM Reference
NVIDIA NIM

Nemotron 3 Super-120B-A12B on NVIDIA NIM

Nemotron 3 · NVIDIA AI

ServerlessOpen Weights

Last refreshed 2026-06-01. Next refresh: weekly.

Why use Nemotron 3 Super-120B-A12B on NVIDIA NIM?

NVIDIA NIM offers Nemotron 3 Super-120B-A12B with pay-as-you-go pricing at $0.10/1M input tokens. NVIDIA NIM is NVIDIA's deployment platform for GPU-accelerated inference microservices.

Compare Nemotron 3 Super-120B-A12B across 6 providers to find the best fit for your use case
Input / 1M
$0.10
Output / 1M
$0.50
Cache
read $0.10
Batch
Not sourced

Setup recipe

Docs fallback
Install
Use the provider REST API or SDK
Auth
Create a provider API key
Call
model: nvidia/nemotron-3-super-120b-a12b
Model ID
nvidia/nemotron-3-super-120b-a12b

Request example

Curated snippets for this provider are not sourced yet. Use NVIDIA NIM documentation with model ID nvidia/nemotron-3-super-120b-a12b.

Gotchas

  • Use provider model ID "nvidia/nemotron-3-super-120b-a12b", not the LLMReference slug "nemotron-3-super-120b-a12b".

Compare Nemotron 3 Super-120B-A12B Across Providers

ProviderInput (per 1M)Output (per 1M)
Cloudflare Workers AI
DeepInfra$0.10$0.50
NVIDIA NIM$0.10$0.50
OpenRouter$0.09$0.45
Fireworks AI
View all 6 providers →

Pricing

TypePrice (per 1M)
Input tokens$0.10
Output tokens$0.50

Capabilities

Structured Outputs

About Nemotron 3 Super-120B-A12B

NVIDIA Nemotron 3 Super-120B-A12B is a 120B total / 12B active hybrid Latent MoE model with interleaved Mamba-2 and MoE layers for agentic, reasoning, and conversational tasks. Fireworks lists the NVFP4 variant for on-demand deployment with 262k context.

FAQ

What does Nemotron 3 Super-120B-A12B cost on NVIDIA NIM?

On NVIDIA NIM, Nemotron 3 Super-120B-A12B costs $0.10 per 1M input tokens and $0.50 per 1M output tokens.

What is the context window for Nemotron 3 Super-120B-A12B on NVIDIA NIM?

Nemotron 3 Super-120B-A12B supports a 1m token context window on NVIDIA NIM.

How does NVIDIA NIM compare to other Nemotron 3 Super-120B-A12B providers?

Nemotron 3 Super-120B-A12B is available from 6 providers. The cheapest input pricing is $0.09/1M tokens from OpenRouter.

What API model ID do I use for Nemotron 3 Super-120B-A12B on NVIDIA NIM?

Use the model ID nvidia/nemotron-3-super-120b-a12b when calling NVIDIA NIM's API.

Who created Nemotron 3 Super-120B-A12B?

Nemotron 3 Super-120B-A12B was created by NVIDIA AI as part of the Nemotron 3 model family.

Is Nemotron 3 Super-120B-A12B open source?

Nemotron 3 Super-120B-A12B has open weights under NVIDIA Open Model according to the seed data, but that does not necessarily mean an OSI-approved open-source license.

Get Started