Using Nemotron 3 Super-120B-A12B on OpenRouter
Implementation guide · Nemotron 3 · NVIDIA AI
ServerlessOpen Weights
Quick Start
- 1
- 2Use the OpenRouter SDK or REST API to call
nvidia/nemotron-3-super-120b-a12b— see the documentation for request format. - 3
Code Examples
About OpenRouter
OpenRouter provides a unified interface for Large Language Models with better pricing, improved uptime, and no subscription requirements. Route across providers for cost optimization and reliability.
OpenRouter is a multi-provider LLM aggregator offering unified API access to 300+ models from all major labs and emerging providers, with automatic failover for reliability.
Pricing on OpenRouter
| Type | Price (per 1M) |
|---|---|
| Input tokens | $0.09 |
| Output tokens | $0.45 |
Capabilities
Structured Outputs
About Nemotron 3 Super-120B-A12B
NVIDIA Nemotron 3 Super-120B-A12B is a 120B total / 12B active hybrid Latent MoE model with interleaved Mamba-2 and MoE layers for agentic, reasoning, and conversational tasks. Fireworks lists the NVFP4 variant for on-demand deployment with 262k context.
Model Specs
Released2026-03-11
Parameters120B
Context1.05m
ArchitectureDecoder Only