All comparisons
HellaSwag 95.7 — HumanEval 85.5 — Massive Multitask Language Understanding 88.5 —
DeepSeek V3 vs DeepSeek R1
Side-by-side comparison of specifications, capabilities, and pricing.
| Released | 2024-12-26 | 2025-01-20 |
| Context window | — | 128K |
| Parameters | — | 671B, 37B Active |
| Architecture | mixture of experts | decoder only |
| License | Unknown | Unknown |
| Knowledge cutoff | — | — |
Capabilities | ||
| Multimodal | ||
| Function calling | ||
| Tool use | ||
| JSON mode | ||
Availability | ||
| Providers | ||
Benchmarks