LLM Reference
OpenRouter

Using Nemotron 3 Super-120B-A12B on OpenRouter

Implementation guide · Nemotron 3 · NVIDIA AI

ServerlessOpen Weights

Quick Start

  1. 1
    Create an account at OpenRouter and generate an API key.
  2. 2
    Use the OpenRouter SDK or REST API to call nvidia/nemotron-3-super-120b-a12b — see the documentation for request format.
  3. 3
    You'll be billed $0.09/1M input, $0.45/1M output tokens. See full pricing.

Code Examples

See OpenRouter documentation for integration details.

About OpenRouter

OpenRouter provides a unified interface for Large Language Models with better pricing, improved uptime, and no subscription requirements. Route across providers for cost optimization and reliability.

OpenRouter is a multi-provider LLM aggregator offering unified API access to 300+ models from all major labs and emerging providers, with automatic failover for reliability.

Pricing on OpenRouter

TypePrice (per 1M)
Input tokens$0.09
Output tokens$0.45

Capabilities

Structured Outputs

About Nemotron 3 Super-120B-A12B

NVIDIA Nemotron 3 Super-120B-A12B is a 120B total / 12B active hybrid Latent MoE model with interleaved Mamba-2 and MoE layers for agentic, reasoning, and conversational tasks. Fireworks lists the NVFP4 variant for on-demand deployment with 262k context.

Model Specs

Released2026-03-11
Parameters120B
Context1.05m
ArchitectureDecoder Only

Provider

OpenRouter
OpenRouter

OpenRouter, Inc.

New York, NY, USA