Using Nemotron 3 Ultra on OpenRouter
Implementation guide · Nemotron 3 · NVIDIA AI
ServerlessOpen Weights
Quick Start
- 1
- 2Use the OpenRouter SDK or REST API to call
nvidia/nemotron-3-ultra-550b-a55b— see the documentation for request format. - 3
Code Examples
About OpenRouter
OpenRouter provides a unified interface for Large Language Models with better pricing, improved uptime, and no subscription requirements. Route across providers for cost optimization and reliability.
OpenRouter is a multi-provider LLM aggregator offering unified API access to 300+ models from all major labs and emerging providers, with automatic failover for reliability.
Pricing on OpenRouter
| Type | Price (per 1M) |
|---|---|
| Input tokens | $0.50 |
| Output tokens | $2.20 |
Capabilities
Reasoning
About Nemotron 3 Ultra
NVIDIA's open frontier-reasoning model (550B total / 55B active MoE, hybrid Transformer-Mamba). Highest Artificial Analysis Intelligence Index for any US open model (score: 48). 300+ tokens/second. 1M-token context. Announced at Computex 2026. Pricing: ~$0.60/$2.60 per 1M tokens (provider median); free tier on some providers.
Model Specs
Released2026-06-04
Parameters550B
Context1m
ArchitectureMixture of Experts