Llama 3.3 Nemotron Super 49B v1
llama-3.3-nemotron-super-49b-v1
Last refreshed 2026-05-14. Next refresh: weekly.
Llama 3.3 Nemotron Super 49B v1 is worth evaluating for long context when its provider route and context window match the workload.
Decision context: Long context task fit, 1 tracked provider route, and research from 2026-01-01.
Use it for
- Teams evaluating long context
- Workloads that can use a 128K context window
- Buyers comparing 1 tracked provider route
Do not use it for
- Vision or document-understanding workloads
- Strict JSON or tool-calling flows
Cheapest output
-
NVIDIA NIM per 1M tokens
Provider routes
1
Tracked API hosts
Quality / dollar
Unknown
No task benchmark coverage yet
Freshness
2026-01-01
Researched 137d ago
Top use-case fit
Long context
Included by capability and metadata signals in the decision map.
Provider price ladder
| Provider | Input / 1M | Output / 1M | Route |
|---|---|---|---|
| NVIDIA NIM | - | - | ServerlessPartial |
Benchmark peer barsfor Long context
No task-mapped benchmark peers are available for this model yet.
Migration checks
No linked migration route is available for this model yet.
About
First version of NVIDIA Nemotron Super 49B, a pruned and distilled Llama 3.3 model.
Llama 3.3 Nemotron Super 49B v1 has a 128K-token context window.
Capabilities
No model capability flags are currently sourced.
Rankings
Compare
All comparisons →Specifications
Created by
Accelerated AI for enterprise solutions