LLM Reference

Nemotron-4 Models by NVIDIA AI

NVIDIA AINVIDIA Open ModelOpen weights
3 models2024–2025Up to 4k ctxFrom $4.2/1M input

Details

ResearcherNVIDIA AI
Commercial useCommercial use allowed
Models3
Released2024–2025
Max context4k

Capabilities

Structured Outputs1 of 3 models

About

The Nemotron-4 340B family consists of large language models (LLMs) that are openly accessible and tailored for synthetic data generation, crucial for training other LLMs 34. This innovative suite includes a base model, instruct model, and reward model, each serving unique purposes. The base model, trained on an extensive 9 trillion token dataset, supports the instruct model in producing diverse synthetic data that emulates real-world scenarios, while the reward model focuses on evaluating and refining outputs for helpfulness and coherence 12. Optimized for NVIDIA's NeMo framework and TensorRT-LLM for inference, these models are designed for both research and commercial use due to their open licensing. Moreover, the fully open-sourced pipeline encourages AI community collaboration and innovation 9.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

2 in view1 retired

Use when the workload needs 4k context and 4B parameters.

2024-094k context4B parameters

Use when the workload needs 4k context and 4B parameters.

2024-084k context4B parameters

Release Timeline

3 release groups
2025-02
1 retired
Nemotron 4 340B
4k context340B parametersstructured outputs
Archived
2024-09
1 current
Nemotron Mini Hindi 4B Instruct
4k context4B parameters
Current
2024-08
1 current
Nemotron Mini 4B Instruct
4k context4B parameters
Current

Specifications(3 models)

Nemotron-4 model specifications comparison
ModelReleasedContextParametersStructured Outputs
Nemotron Mini Hindi 4B Instruct2024-094k4BNo
Nemotron Mini 4B Instruct2024-084k4BNo

Available From(2 providers)

Pricing

Frequently Asked Questions

What is Nemotron-4 used for?
Nemotron-4 is used for structured outputs. The family description and listed model capabilities point to those workloads as the best fit.
How does Nemotron-4 compare to NVIDIA Nemotron Nano 12B v2 VL?
Nemotron-4 by NVIDIA AI is strongest where you need structured outputs, while NVIDIA Nemotron Nano 12B v2 VL by NVIDIA AI is the closest related family to check for structured outputs. Nemotron-4 has 3 listed variants and reaches up to 4k context, so compare the specs and pricing tables before choosing a production model.
Which Nemotron-4 model should I use?
For the lowest listed input price, start with Nemotron 4 340B through DeepInfra at $4.2/1M input tokens. For the most capable/latest local choice, evaluate Nemotron Mini Hindi 4B Instruct with 4k context.