Jet-Nemotron Models by NVIDIA AI
2 models2025
About
Efficient hybrid language models optimized for cost and latency
Current Variants
Use-when guidance is derived from seed capabilities, context, release, and replacement fields.
2 in view
| Model | Use when | Released | Signals | Status |
|---|---|---|---|---|
| Jet-Nemotron 2B | Use when the workload needs 2B parameters. | 2025-08 | 2B parameters | Current |
| Jet-Nemotron 4B | Use when the workload needs 4B parameters. | 2025-08 | 4B parameters | Current |
Release Timeline
1 release group2025-08
2 current
Jet-Nemotron 2B
Current2B parameters
Jet-Nemotron 4B
Current4B parameters
Specifications(2 models)
| Model | Released | Parameters |
|---|---|---|
| Jet-Nemotron 2B | 2025-08 | 2B |
| Jet-Nemotron 4B | 2025-08 | 4B |
Frequently Asked Questions
- What is Jet-Nemotron used for?
- Efficient hybrid language models optimized for cost and latency
- How does Jet-Nemotron compare to NVIDIA Nemotron Nano 12B v2 VL?
- Jet-Nemotron by NVIDIA AI is strongest where you need its listed use cases, while NVIDIA Nemotron Nano 12B v2 VL by NVIDIA AI is the closest related family to check for structured outputs. Jet-Nemotron has 2 listed variants, so compare the specs and pricing tables before choosing a production model.
- Which Jet-Nemotron model should I use?
- If price is the main constraint, use the pricing table first because Jet-Nemotron does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate Jet-Nemotron 2B.




