All comparisons
HellaSwag — 96.2 HumanEval — 92.0 Massive Multitask Language Understanding — 88.7
Claude Sonnet 4.6 vs Claude 3.5 Sonnet
Side-by-side comparison of specifications, capabilities, and pricing.
| Released | 2026-02-17 | 2024-06-20 |
| Context window | 200K | 200K |
| Parameters | — | 70B |
| Architecture | decoder only | decoder only |
| License | Proprietary | Unknown |
| Knowledge cutoff | 2025-12 | 2024-04 |
Capabilities | ||
| Multimodal | ||
| Function calling | ||
| Tool use | ||
| JSON mode | ||
Availability | ||
| Providers | ||
Benchmarks