Nemotron 4 340B
Nemotron 4 340B is a legacy integration reference; keep it only while you identify a current replacement.
Use it for
- Teams maintaining an existing integration
- Workloads that can use a 4k context window
- Buyers comparing 2 tracked provider routes
Do not use it for
- New production launches
- Vision or document-understanding workloads
- Family
- Nemotron-4
- Released
- 2025-02-27
- Context
- 4k
- Parameters
- 340B
- Architecture
- Decoder Only
- Specialization
- general
- Training
- finetuned
Cheapest of 2 routes · DeepInfra
About
Nemotron 4 340B is NVIDIA AI's Nemotron-4 model. It is deprecated (originally released 2025-02-27); use it only for reproducing earlier results or evaluating drift over time.
Nemotron 4 340B is a model in the Nemotron-4 family. The structured metadata tracks a 4k-token context window and structured outputs. This page tracks provider routes through NVIDIA NIM and DeepInfra, with the cheapest tracked route listed at $4.2 input and $4.2 output per 1M tokens. No headline benchmark score is tracked for Nemotron 4 340B yet.
Top use-case fit
Classification
Included by capability and metadata signals in the decision map.
JSON / Tool use
Included by capability and metadata signals in the decision map.
Provider price ladder
Compare all 2Compare API pricing across 2 providers for input and output tokens, batch, and cached reads when available.
| Provider | Input / 1M | Output / 1M | Route |
|---|---|---|---|
| DeepInfra | $4.20 | $4.20 | Serverless |
| NVIDIA NIM | - | - | ProvisionedPartial |
Capabilities
Benchmark peer barsfor Classification
No task-mapped benchmark peers are available for this model yet.
Migration checks
No linked migration route is available for this model yet.