LLM Reference

DeepSeek V4 Models by DeepSeek

DeepSeekMITOpen source
2 models2026Up to 1m ctxFrom $0.0983/1M input

Details

ResearcherDeepSeek
LicenseMITOSI-approved
Commercial useCommercial use: permitted
Models2
Released2026
Max context1m

Capabilities

ReasoningAll models
Function CallingAll models
Tool UseAll models
Structured OutputsAll models

Links

Website

About

DeepSeek V4 is the April 2026 DeepSeek release for long-context reasoning and coding. The family has two models: DeepSeek V4 Pro, a 1.6T-parameter MoE with 49B active parameters, a 1M-token context window, and the family's highest SWE-bench Verified score at 80.6, and DeepSeek V4 Flash, a 284B MoE with 13B active parameters for lower-cost inference. Direct DeepSeek pricing starts at $0.435 / $0.87 per million input / output tokens for Pro and $0.14 / $0.28 for Flash.

Compare Against Top Competitors

Long-context coding and reasoning set for comparing DeepSeek V4 Pro and Flash against Kimi, Claude, GLM, and GPT-5.Scores come from existing benchmark seed data; "-" means this site has no local score for that benchmark yet.

DeepSeek V4 flagship benchmark comparison against top competitor models
ModelContextInput / 1MChatbot ArenaSWE-bench VerifiedLiveCodeBenchGPQA
DeepSeek V4 Profamily pick1m$0.435/1M1,46080.693.590.1
DeepSeek V4 Flash1m$0.0983/1M-7991.688.1
Kimi K2.6262k-1,46280.289.690.5
Claude Sonnet 4.61m-1,45979.68089.9
GLM-5.1200k-1,472--86.2
GPT-5.51.05m-1,48882.6-93.6

Current Variants

Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.

2 in view

Use when the workload needs 1m context, 284B parameters, and reasoning.

2026-041m context284B parametersreasoning

Use when the workload needs 1m context, 1600B parameters, and reasoning.

2026-041m context1600B parametersreasoning

Release Timeline

1 release group
2026-04
2 current
DeepSeek V4 Flash
1m context284B parametersreasoning
Current
DeepSeek V4 Pro
1m context1600B parametersreasoning
Current

Specifications(2 models)

DeepSeek V4 model specifications comparison
ModelReleasedContextParametersReasoningFn CallingTool UseStructured Outputs
DeepSeek V4 Flash2026-041m284BYesYesYesYes
DeepSeek V4 Pro2026-041m1.6TYesYesYesYes

Pricing

DeepSeek V4 model pricing by provider
ModelProviderInput / 1MOutput / 1MType
DeepSeek V4 FlashOpenRouter$0.0983$0.1966Serverless
DeepSeek V4 FlashDeepSeek Platform$0.14$0.28Serverless
DeepSeek V4 FlashVercel AI Gateway$0.14$0.28Serverless
DeepSeek V4 FlashNovita AI$0.14$0.28Serverless
DeepSeek V4 ProDeepSeek Platform$0.435$0.87Serverless
DeepSeek V4 ProVercel AI Gateway$0.435$0.87Serverless
DeepSeek V4 ProOpenRouter$0.44$0.87Serverless
DeepSeek V4 ProNovita AI$1.64$3.38Serverless
DeepSeek V4 ProFireworks AI$1.74$3.48Serverless

Frequently Asked Questions

What is DeepSeek V4 used for?
DeepSeek V4 is used for reasoning, agent workflows and tool use, and structured outputs. The family description and listed model capabilities point to those workloads as the best fit.
How does DeepSeek V4 compare to Janus?
DeepSeek V4 by DeepSeek is strongest where you need reasoning, while Janus by DeepSeek is the closest related family to check for image generation. DeepSeek V4 has 2 listed variants and reaches up to 1m context, so compare the specs and pricing tables before choosing a production model.
Which DeepSeek V4 model should I use?
For the lowest listed input price, start with DeepSeek V4 Flash through OpenRouter at $0.0983/1M input tokens. For the most capable/latest local choice, evaluate DeepSeek V4 Flash with 1m context and reasoning, tool use, function calling, and structured outputs.