All comparisons
HellaSwag 95.7 — HumanEval 85.5 — Massive Multitask Language Understanding 88.5 86.5 LiveCodeBench 49.6 — Aider Polyglot 48.4 — BigCodeBench 50.0 — Chatbot Arena 1302.0 1242.0 MMLU PRO 75.9 69.4
DeepSeek V3 vs GPT-4 Turbo
Side-by-side comparison of specifications, capabilities, and pricing.
| Released | 2024-12-26 | 2024-04-09 |
| Context window | 64k | 128K |
| Parameters | 671B | 1.76T (8x222B MoE)* |
| Architecture | mixture of experts | mixture of experts |
| License | Open Source | Proprietary |
| Knowledge cutoff | 2024-04 | 2023-12 |
Capabilities | ||
| Vision | ||
| Multimodal | ||
| Reasoning | ||
| Function calling | ||
| Tool use | ||
| Structured Outputs | ||
| Code execution | ||
Availability | ||
| Providers | ||
Benchmarks