LLM Reference

Nemotron 4 340B

Released
2025-02-27
Last refreshed
2026-05-19
Status
Researched 16d ago
DeprecatedClassificationJSON / Tool use

Nemotron 4 340B is a legacy integration reference; keep it only while you identify a current replacement.

Use it for

  • Teams maintaining an existing integration
  • Workloads that can use a 4k context window
  • Buyers comparing 2 tracked provider routes

Do not use it for

  • New production launches
  • Vision or document-understanding workloads
Specifications
Released
2025-02-27
Context
4k
Parameters
340B
Architecture
Decoder Only
Specialization
general
Training
finetuned
Created by

Accelerated AI for enterprise solutions

Santa Clara, California, United States
Founded 2015
Website
Pricing
Output / 1M
$4.20
Input / 1M
$4.20

Cheapest of 2 routes · DeepInfra

About

Nemotron 4 340B is NVIDIA AI's Nemotron-4 model. It is deprecated (originally released 2025-02-27); use it only for reproducing earlier results or evaluating drift over time.

Nemotron 4 340B is a model in the Nemotron-4 family. The structured metadata tracks a 4k-token context window and structured outputs. This page tracks provider routes through NVIDIA NIM and DeepInfra, with the cheapest tracked route listed at $4.2 input and $4.2 output per 1M tokens. No headline benchmark score is tracked for Nemotron 4 340B yet.

Top use-case fit

Classification

Included by capability and metadata signals in the decision map.

JSON / Tool use

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare all 2

Compare API pricing across 2 providers for input and output tokens, batch, and cached reads when available.

ProviderInput / 1MOutput / 1MRoute
DeepInfra$4.20$4.20
Serverless
NVIDIA NIM--
ProvisionedPartial

Capabilities

Structured Outputs

Benchmark peer barsfor Classification

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

Comparison and alternatives

Browse all comparisons →