Using Nemotron 3 Ultra on OpenRouter

Implementation guide · Nemotron 3 · NVIDIA AI

ServerlessOpen Weights

Quick Start

1
Create an account at OpenRouter and generate an API key.
2
Use the OpenRouter SDK or REST API to call nvidia/nemotron-3-ultra-550b-a55b — see the documentation for request format.
3
You'll be billed $0.50/1M input, $2.20/1M output tokens. See full pricing.

API Portal Documentation Pricing Model Card

Code Examples

See OpenRouter documentation for integration details.

About OpenRouter

OpenRouter provides a unified interface for Large Language Models with better pricing, improved uptime, and no subscription requirements. Route across providers for cost optimization and reliability.

OpenRouter is a multi-provider LLM aggregator offering unified API access to 300+ models from all major labs and emerging providers, with automatic failover for reliability.

View all models on OpenRouter →

Pricing on OpenRouter

Type	Price (per 1M)
Input tokens	$0.50
Output tokens	$2.20

Capabilities

Reasoning

About Nemotron 3 Ultra

NVIDIA's open frontier-reasoning model (550B total / 55B active MoE, hybrid Transformer-Mamba). Highest Artificial Analysis Intelligence Index for any US open model (score: 48). 300+ tokens/second. 1M-token context. Announced at Computex 2026. Pricing: ~$0.60/$2.60 per 1M tokens (provider median); free tier on some providers.

Full model details →

Model Specs

Released2026-06-04

Parameters550B

Context1m

ArchitectureMixture of Experts

More Models on OpenRouter

Nemotron 3 Super-120B-A12B Nemotron 3 Nano Omni

All models on OpenRouter →

Provider

OpenRouter

OpenRouter, Inc.

New York, NY, USA