DeepSeek V3 vs Qwen2.5 72B

Side-by-side comparison of specifications, capabilities, and pricing.

	DeepSeek V3 DeepSeek	Qwen2.5 72B Alibaba
Released	2024-12-26	2024-06-07
Context window	64k	128K
Parameters	671B	72.7B
Architecture	mixture of experts	decoder only
License	Open Source	Apache 2.0
Knowledge cutoff	2024-04	—
Capabilities
Vision
Multimodal
Reasoning
Function calling
Tool use
Structured Outputs
Code execution
Availability
Providers	DeepInfra Fireworks AI DeepSeek Platform Microsoft Foundry OpenRouter NVIDIA NIM AWS Bedrock Together AI Bitdeer AI SiliconFlow Replicate API	Fireworks AI Bitdeer AI

Benchmarks

HellaSwag95.795.6HumanEval85.592.7Massive Multitask Language Understanding88.588.2LiveCodeBench49.6—Aider Polyglot48.4—BigCodeBench50.0—Chatbot Arena1302.0—MMLU PRO75.9—Google-Proof Q&A—65.4

Full DeepSeek V3 specs →Full Qwen2.5 72B specs →