DeepSeek V3
DeepSeek V3 is worth evaluating for coding, agents, and classification when its provider route and context window match the workload.
Use it for
- Teams evaluating coding, agents, and classification
- Workloads that can use a 64k context window
- Buyers comparing 4 tracked provider routes
Do not use it for
- Vision or document-understanding workloads
- Family
- DeepSeek V3
- Released
- 2024-12-26
- Context
- 64k
- Parameters
- 671B
- Architecture
- Mixture of Experts
- Knowledge cutoff
- 2024-04
- Specialization
- general
- Training
- finetuned
Cheapest of 13 routes · DeepSeek Platform
About
DeepSeek V3: Latest flagship model. 685B total with MoE. 128K context. Open-source.
DeepSeek V3 is an open-source model. The structured metadata tracks a 64k-token context window, function calling, tool use, and structured outputs. This page tracks provider routes through DeepInfra, Fireworks AI, DeepSeek Platform, and 10 more, with the cheapest tracked route listed at $0.1 input and $0.3 output per 1M tokens. Headline tracked benchmarks include HellaSwag 95.7, HumanEval 85.5, and Massive Multitask Language Understanding 88.5.
Top use-case fit: coding, agents, and build tasks
Coding
Q/$ B4 relevant benchmarks in the decision map.
Agents
Included by capability and metadata signals in the decision map.
Classification
Q/$ B3 relevant benchmarks in the decision map.
Provider price ladder
Compare all 13Compare API pricing across 4 providers for input and output tokens, batch, and cached reads when available.
| Provider | Input / 1M | Output / 1M | Route |
|---|---|---|---|
| DeepSeek Platform | $0.140 | $0.280 | Serverless |
| Bitdeer AI | $0.100 | $0.300 | Serverless |
| OpenRouter | $0.252 | $0.378 | Serverless |
| SiliconFlow | $0.150 | $0.500 | Serverless |
Capabilities
Benchmark peer barsfor Coding
Benchmark scores(9)
| Benchmark | Score | Version | Source |
|---|---|---|---|
| HellaSwag | 95.7 | 10-shot | https://arxiv.org/abs/2412.19437 |
| HumanEval | 85.5 | pass@1 | https://arxiv.org/abs/2412.19437 |
| Massive Multitask Language Understanding | 88.5 | 5-shot | https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard |
| LiveCodeBench | 49.6 | 2026-04 | https://livecodebench.github.io/performances_generation.json |
| Aider Polyglot | 48.4 | 2026-04 | https://aider.chat/docs/leaderboards |
| BigCodeBench | 50.0 | 2025-01 (Instruct Pass@1) | https://bigcode-bench.github.io/results.json |
| Chatbot Arena | 1302.0 | — | https://lmarena.ai |
| MMLU PRO | 75.9 | — | https://huggingface.co/spaces/TIGER-Lab/MMLU-Pro |
| Mostly Basic Programming Problems+ | 76.0 | — | https://evalplus.github.io/leaderboard.html |
Migration checks
No linked migration route is available for this model yet.