All comparisons
HellaSwag — 96.2 HumanEval — 92.0 Massive Multitask Language Understanding — 88.7
GPT-4o (08-06) vs Claude 3.5 Sonnet
Side-by-side comparison of specifications, capabilities, and pricing.
| Released | 2024-08-06 | 2024-06-20 |
| Context window | 128K | 200K |
| Parameters | 1.76T (8x222B MoE)* | 70B |
| Architecture | mixture of experts | decoder only |
| License | Proprietary | Unknown |
| Knowledge cutoff | 2023-10 | 2024-04 |
Capabilities | ||
| Multimodal | ||
| Function calling | ||
| Tool use | ||
| JSON mode | ||
Availability | ||
| Providers | ||
Benchmarks