Grok 4.20 Reasoning

Name: Grok 4.20 Reasoning
Author: xAI

Released

2026-03-10

Last refreshed

2026-06-29

Status

Researched 41d ago

ProprietaryCommercial use: conditionalMultimodalRAGAgentsLong contextVisionJSON / Tool use

Grok 4.20 Reasoning is worth evaluating for rag, agents, and long context when its provider route and context window match the workload.

Use it for

Teams evaluating rag, agents, and long context
Workloads that can use a 1m context window
Buyers comparing 2 tracked provider routes

Do not use it for

Workloads where another current model has stronger sourced task evidence

Specifications

Family: Grok 4
Released: 2026-03-10
Context: 1m
Knowledge cutoff: 2024-11
Specialization: reasoning
Openness: Proprietary
License: ProprietaryCommercial use: conditional

Created by

xAI

Ethical AI for universal truth-seeking

San Francisco, California, United States

Founded 2023

Website

Pricing

Output / 1M

$2.50

Input / 1M

$1.25

Cheapest of 2 routes · Vercel AI Gateway · cache read $0.200

Providers(2)

xAI Console Vercel AI Gateway

View 2 provider routes

Links

Website

About

Grok 4.20 Reasoning is the xAI API reasoning variant launched around March 10, 2026 as grok-4.20-0309-reasoning. The prior May 2026 seed date was a placeholder; this model was already available months earlier and remains active.

Grok 4.20 Reasoning is a proprietary model in the Grok 4 family. The structured metadata tracks a 1m-token context window, multimodal input, reasoning, function calling, tool use, and structured outputs. This page tracks provider routes through xAI Console and Vercel AI Gateway, with the cheapest tracked route listed at $1.25 input and $2.5 output per 1M tokens. Headline tracked benchmarks include Google-Proof Q&A 88.5 and Humanity's Last Exam 30.0.

Top use-case fit: coding, agents, and build tasks

RAG

Included by capability and metadata signals in the decision map.

Agents

Included by capability and metadata signals in the decision map.

Long context

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare all 2

Compare API pricing across 2 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Cache	Route
Vercel AI Gateway	$1.25	$2.50	read $0.200	Serverless
xAI Console	$1.25	$2.50	-	Serverless

Available via routers & gateways(1)

OpenRouter

Hybrid

Unified hybrid gateway to 400+ models from 60+ providers via a single OpenAI-compatible API, with optional auto-routing that selects the best model per prompt.

PassthroughxAI Console

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured Outputs

Benchmark peer barsfor RAG

No task-mapped benchmark peers are available for this model yet.

Benchmark scores(2)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.

Benchmark	Score	Version	Source
Google-Proof Q&A	88.5	GPQA Diamond (accuracy)	https://designforonline.com/ai-models/xai-grok-4-20-0309-reasoning/
Humanity's Last Exam	30.0	HLE for Grok 4 (accuracy)	https://designforonline.com/ai-models/xai-grok-4-20-0309-reasoning/