LLM Reference

Grok 4 Heavy

grok-4-heavy

Researched 1d ago

Last refreshed 2026-05-20. Next refresh: weekly.

ProprietaryMultimodalCodingLong contextVision

Grok 4 Heavy has model metadata, but missing tracked provider pricing keeps it from being a default production pick.

Decision context: Coding task fit, 0 tracked provider routes, and research from 2026-05-20.

Use it for

  • Teams evaluating coding, long context, and vision
  • Workloads that can use a 256k context window

Do not use it for

  • Cost-sensitive launches that need sourced token pricing
  • Strict JSON or tool-calling flows
  • Teams that need a tracked hosted API route today

Cheapest output

-

No tracked output price

Provider routes

0

No provider route in seed

Quality / dollar

Unknown

No output-token price in the ladder

Freshness

2026-05-20

Researched 1d ago

fresh

Top use-case fit

Coding

1 relevant benchmark in the decision map.

Long context

Included by capability and metadata signals in the decision map.

Vision

Included by capability and metadata signals in the decision map.

Provider price ladder

No tracked provider token pricing is available for this model yet.

Benchmark peer barsfor Coding

Migration checks

No linked migration route is available for this model yet.

About

Grok 4 Heavy is xAI's Grok 4 model with multimodal text and image input. It offers a 256K-token context window.

Grok 4 Heavy has a 256K-token context window.

Capabilities

Multimodal

Benchmark Scores(1)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.
BenchmarkScoreVersionSource
SWE-bench Pro39.8DAT-1778

Rankings

Specifications

FamilyGrok 4
Released2025-07-09
Context256k
Knowledge cutoff2024-11

Created by

Ethical AI for universal truth-seeking

San Francisco, California, United States
Founded 2023
Website