LLM Reference

Nemotron 3 Ultra

Released
2026-06-04
Last refreshed
2026-06-15
Status
Researched 21d ago
Open weightsCommercial use: permittedLong context

Nemotron 3 Ultra is worth evaluating for long context when its provider route and context window match the workload.

Use it for

  • Teams evaluating long context
  • Workloads that can use a 1m context window
  • Buyers comparing 1 tracked provider route

Do not use it for

  • Vision or document-understanding workloads
  • Strict JSON or tool-calling flows
Specifications
Released
2026-06-04
Context
1m
Parameters
550B
Architecture
Mixture of Experts
Specialization
general
Openness
Open weights
License
NVIDIA Open ModelCommercial use: permitted
Created by

Accelerated AI for enterprise solutions

Santa Clara, California, United States
Founded 2015
Website
Pricing
Output / 1M
$2.20
Input / 1M
$0.500

Cheapest of 1 route · OpenRouter

About

NVIDIA's open frontier-reasoning model (550B total / 55B active MoE, hybrid Transformer-Mamba). Highest Artificial Analysis Intelligence Index for any US open model (score: 48). 300+ tokens/second. 1M-token context. Announced at Computex 2026. Pricing: ~$0.60/$2.60 per 1M tokens (provider median); free tier on some providers.

Nemotron 3 Ultra is an open-weight model in the Nemotron 3 family. The structured metadata tracks a 1m-token context window and reasoning. This page tracks provider routes through OpenRouter, with the cheapest tracked route listed at $0.5 input and $2.2 output per 1M tokens. No headline benchmark score is tracked for Nemotron 3 Ultra yet.

Top use-case fit

Long context

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.

ProviderInput / 1MOutput / 1MRoute
OpenRouter$0.500$2.20
Serverless

Capabilities

Reasoning

Benchmark peer barsfor Long context

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

Frequently asked questions

What is the context window of Nemotron 3 Ultra?

Nemotron 3 Ultra has a context window of 1m tokens.

How much does Nemotron 3 Ultra cost?

Nemotron 3 Ultra is available at $0.5/1M input tokens through OpenRouter.

When was Nemotron 3 Ultra released?

Nemotron 3 Ultra was released on 2026-06-04.

Which providers offer Nemotron 3 Ultra?

Nemotron 3 Ultra is available from 1 provider: OpenRouter.