Nemotron 4 340B

Name: Nemotron 4 340B
Author: NVIDIA AI

Released

2025-02-27

Last refreshed

2026-06-15

Status

Researched 60d ago

DeprecatedOpen weightsCommercial use: permittedClassificationJSON / Tool use

Nemotron 4 340B is a legacy integration reference; keep it only while you identify a current replacement.

Use it for

Teams maintaining an existing integration
Workloads that can use a 4k context window
Buyers comparing 2 tracked provider routes

Do not use it for

New production launches
Vision or document-understanding workloads

Specifications

Family: Nemotron-4
Released: 2025-02-27
Context: 4k
Parameters: 340B
Architecture: Decoder Only
Specialization: general
Openness: Open weights
License: NVIDIA Open ModelCommercial use: permitted
Weights: Unknown
Code: Unknown
Training: Fine-tuned

Created by

NVIDIA AI

Accelerated AI for enterprise solutions

Santa Clara, California, United States

Founded 2015

Website

Pricing

Output / 1M

$4.20

Input / 1M

$4.20

Cheapest of 2 routes · DeepInfra

Providers(2)

NVIDIA NIM DeepInfra

View 2 provider routes

About

Nemotron 4 340B is NVIDIA AI's Nemotron-4 model. It is deprecated (originally released 2025-02-27); use it only for reproducing earlier results or evaluating drift over time.

Nemotron 4 340B is an open-weight model in the Nemotron-4 family. The structured metadata tracks a 4k-token context window and structured outputs. This page tracks provider routes through NVIDIA NIM and DeepInfra, with the cheapest tracked route listed at $4.2 input and $4.2 output per 1M tokens. No headline benchmark score is tracked for Nemotron 4 340B yet.

Top use-case fit

Classification

Included by capability and metadata signals in the decision map.

JSON / Tool use

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare all 2

Compare API pricing across 2 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Route
DeepInfra	$4.20	$4.20	Serverless
NVIDIA NIM	-	-	ProvisionedPartial

Available via routers & gateways(1)

NVIDIA LLM Router Blueprint

Router

NVIDIA's open-source AI blueprint for LLM routing that selects the optimal model per prompt via intent classification or neural auto-routing; being deprecated 2026-06-20.

Free OSSNVIDIA NIM