LLM ReferenceLLM Reference
All comparisons

Grok-2 vs o3

Side-by-side comparison of specifications, capabilities, and pricing.

Released2024-08-012025-03-31
Context window128K128K
Parameters
Architecturedecoder onlydecoder only
LicenseUnknownUnknown
Knowledge cutoff

Capabilities

Vision
Multimodal
Reasoning
Function calling
Tool use
Structured Outputs
Code execution

Availability

Providers

Benchmarks

HumanEval88.496.7Massive Multitask Language Understanding87.5—Chatbot Arena1255.01412.0SWE-bench Verified—71.7LiveCodeBench—79.1Aider Polyglot—81.3Google-Proof Q&A—87.7Massive Multi-discipline Multimodal Understanding—82.9