LLM ReferenceLLM Reference

Jet-Nemotron Models by NVIDIA AI

2 models2025

About

Efficient hybrid language models optimized for cost and latency

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

2 in view

Use when the workload needs 2B parameters.

2025-082B parameters

Use when the workload needs 4B parameters.

2025-084B parameters

Release Timeline

1 release group
2025-08
2 current
Jet-Nemotron 2B
2B parameters
Current
Jet-Nemotron 4B
4B parameters
Current

Specifications(2 models)

Jet-Nemotron model specifications comparison
ModelReleasedParameters
Jet-Nemotron 2B2025-082B
Jet-Nemotron 4B2025-084B

Frequently Asked Questions

What is Jet-Nemotron used for?
Efficient hybrid language models optimized for cost and latency
How does Jet-Nemotron compare to NVIDIA Nemotron Nano 12B v2 VL?
Jet-Nemotron by NVIDIA AI is strongest where you need its listed use cases, while NVIDIA Nemotron Nano 12B v2 VL by NVIDIA AI is the closest related family to check for structured outputs. Jet-Nemotron has 2 listed variants, so compare the specs and pricing tables before choosing a production model.
Which Jet-Nemotron model should I use?
If price is the main constraint, use the pricing table first because Jet-Nemotron does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate Jet-Nemotron 2B.

Models(2)