Grok 4 Heavy
Grok 4 Heavy is a released coding, long context, and vision model with 256k context; evaluate it while provider pricing coverage matures.
Use it for
- Teams evaluating coding, long context, and vision
- Workloads that can use a 256k context window
Do not use it for
- Cost-sensitive launches that need sourced token pricing
- Strict JSON or tool-calling flows
- Teams that need a tracked hosted API route today
- Family
- Grok 4
- Released
- 2025-07-09
- Context
- 256k
- Knowledge cutoff
- 2024-11
- Openness
- Proprietary
- License
- ProprietaryCommercial use: conditional
- Weights
- Not released
- Code
- Unknown
No tracked provider token pricing is available yet.
About
Grok 4 Heavy is xAI's Grok 4 model with multimodal text and image input. It offers a 256K-token context window.
Grok 4 Heavy is a proprietary model in the Grok 4 family. The structured metadata tracks a 256k-token context window and multimodal input. Headline tracked benchmarks include SWE-bench Pro 39.8.
Top use-case fit: coding, agents, and build tasks
Coding
1 relevant benchmark in the decision map.
Long context
Included by capability and metadata signals in the decision map.
Vision
Included by capability and metadata signals in the decision map.
Provider price ladder
No tracked provider token pricing is available for this model yet.
Capabilities
Benchmark peer barsfor Coding
Benchmark scores(1)
| Benchmark | Score | Version | Source |
|---|---|---|---|
| SWE-bench Pro | 39.8 | — | https://labs.scale.com/leaderboard/swe_bench_pro_public |
Migration checks
No linked migration route is available for this model yet.
Compare Grok 4 Heavy with other models
Frequently asked questions
What is the context window of Grok 4 Heavy?
Grok 4 Heavy has a context window of 256k tokens.
When was Grok 4 Heavy released?
Grok 4 Heavy was released on 2025-07-09.
What benchmarks has Grok 4 Heavy been tested on?
Grok 4 Heavy has been evaluated on 1 benchmark, including SWE-bench Pro.