All comparisons
HellaSwag 96.2 — HumanEval 92.0 88.4 Massive Multitask Language Understanding 88.7 87.5
Claude 3.5 Sonnet vs Grok-2
Side-by-side comparison of specifications, capabilities, and pricing.
| Released | 2024-06-20 | 2024-08-01 |
| Context window | 200K | — |
| Parameters | 70B | — |
| Architecture | decoder only | decoder only |
| License | Unknown | Unknown |
| Knowledge cutoff | 2024-04 | — |
Capabilities | ||
| Multimodal | ||
| Function calling | ||
| Tool use | ||
| JSON mode | ||
Availability | ||
| Providers | — | |
Benchmarks