LLM Reference
All comparisons

DeepSeek V3 vs DeepSeek R1

Side-by-side comparison of specifications, capabilities, and pricing.

Released2024-12-262025-01-20
Context window128K
Parameters671B, 37B Active
Architecturemixture of expertsdecoder only
LicenseUnknownUnknown
Knowledge cutoff

Capabilities

Multimodal
Function calling
Tool use
JSON mode

Availability

Providers

Benchmarks

HellaSwag95.7—HumanEval85.5—Massive Multitask Language Understanding88.5