Grok-3

Name: Grok-3
Author: xAI

Released

2025-02-17

Last refreshed

2026-06-30

Status

Researched 43d ago

DeprecatedProprietaryCommercial use: conditionalMultimodalCodingRAGAgentsLong contextVisionClassificationJSON / Tool use

Grok-3 is a legacy integration reference; evaluate Grok 4.3 before starting new work.

Use it for

Teams maintaining an existing integration
Workloads that can use a 131k context window
Buyers comparing 3 tracked provider routes

Do not use it for

New production launches

Specifications

Family: Grok 3
Released: 2025-02-17
Context: 131k
Knowledge cutoff: 2025-04
Openness: Proprietary
License: ProprietaryCommercial use: conditional

Created by

xAI

Ethical AI for universal truth-seeking

San Francisco, California, United States

Founded 2023

Website

Pricing

Output / 1M

$2.40

Input / 1M

$0.800

Cheapest of 4 routes · Chutes AI

Providers(4)

OpenRouter xAI Console Chutes AI Microsoft Foundry

View 4 provider routes

This model is deprecated. xAI recommends switching to Grok 4.3.

About

Grok-3 is a 2025 xAI model confirmed for May 15, 2026 retirement; use Grok 4.3 for current API work.

Grok-3 is a proprietary model in the Grok 3 family. The structured metadata tracks a 131k-token context window, multimodal input, reasoning, function calling, tool use, and structured outputs. This page tracks provider routes through OpenRouter, xAI Console, Chutes AI, and 1 more, with the cheapest tracked route listed at $0.8 input and $2.4 output per 1M tokens. Headline tracked benchmarks include Aider Polyglot 53.3, Chatbot Arena 1405.0, and Massive Multi-discipline Multimodal Understanding 78.0.

Top use-case fit: coding, agents, and build tasks

Coding

3 relevant benchmarks in the decision map.

RAG

Included by capability and metadata signals in the decision map.

Agents

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare all 4

Compare API pricing across 3 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Route
Chutes AI	$0.800	$2.40	Serverless
Microsoft Foundry	$3.00	$15.00	Serverless
OpenRouter	$3.00	$15.00	Serverless

Available via routers & gateways(6)

LiteLLM

Gateway

Open-source Python SDK and proxy server that unifies 100+ LLM APIs behind a single OpenAI-compatible interface, with load balancing, cost tracking, and configurable failover.

Free OSSMicrosoft Foundry

OpenRouter

Hybrid

Unified hybrid gateway to 400+ models from 60+ providers via a single OpenAI-compatible API, with optional auto-routing that selects the best model per prompt.

PassthroughxAI Console

Portkey

Gateway

Production AI gateway routing to 1,600+ LLMs with failover, load balancing, semantic caching, and guardrails; Apache 2.0 core is fully self-hostable with the complete feature set.

SubscriptionMicrosoft Foundry

Azure AI Foundry Model Router

Router

Microsoft Azure AI Foundry's native model router that uses a trained ML model to route each prompt in real time to the optimal Azure-hosted model, with Balanced/Cost/Quality mode selection and automatic failover.

PassthroughMicrosoft Foundry

Helicone

Gateway

Observability-first AI gateway with routing, caching, rate limiting, and request tracing; Apache 2.0 open-source core with a managed hosted tier for logging and analytics.

SubscriptionMicrosoft Foundry

Kong AI Gateway

Gateway

Multi-LLM AI gateway built on Kong Gateway 3.x, adding semantic routing, load balancing, guardrails, and MCP traffic analytics as plugins over Kong's existing API management platform.

SubscriptionMicrosoft Foundry

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured Outputs

Benchmark peer barsfor Coding

HumanEvalRank 4 of 96

Claude Sonnet 4.6

98.0

96.7

Claude Opus 4.6

95.0

Grok-3current

94.5

GPT-5.5

94.2

LiveCodeBenchRank 32 of 57

DeepSeek V4 Pro

93.5

Fugu Ultra

93.2

Fugu

92.9

Gemini 3.1 Pro Preview

91.7

Grok-3current

79.4

Aider PolyglotRank 22 of 33

88.0

88.0

84.9

83.1

53.3

Benchmark scores(8)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.

Benchmark	Score	Version	Source
Aider Polyglot	53.3	2026-04	https://aider.chat/docs/leaderboards
Chatbot Arena	1405.0	—	https://lmarena.ai
Massive Multi-discipline Multimodal Understanding	78.0	—	https://mmmu-benchmark.github.io/
MMLU PRO	79.9	—	https://huggingface.co/spaces/TIGER-Lab/MMLU-Pro
Google-Proof Q&A	84.6	—	https://x.ai/blog/grok-3
AIME 2025	93.3	—	https://x.ai/blog/grok-3
LiveCodeBench	79.4	—	https://x.ai/blog/grok-3
HumanEval	94.5	—	https://x.ai/blog/grok-3

Migration checks

No linked migration route is available for this model yet.

Show all 59 popular comparisonssorted by 7-day search impressions

Grok-3 vs Qwen3.6-27B127 Grok-3 vs Hunyuan Hy3 Preview124 Grok-3 vs GLM-5122 Grok-3 vs Gemini 2.5 Pro112 Grok-3 vs Qwen3.6-35B-A3B90 Grok-3 vs o379 Grok-3 vs Claude Opus 4.579 Grok-3 vs Llama 3.2 1B72 Grok-3 vs o3 Mini70 Grok-3 vs Qwen2.5-72B-Instruct68 Grok-3 vs DeepSeek V4 Flash64 Grok-3 vs Trinity-Large-Thinking61 Grok-3 vs Together AI - Llama 3 8B Lite61 Grok-3 vs Kimi K2.558 Grok-3 vs DeepSeek R1 052858 Grok-3 vs Llama 3 70B Instruct52 Grok-3 vs Gemini 2.5 Flash46 Grok-3 vs Mistral Large 3 675B Instruct45 Grok-3 vs Qwen3.5-27B40 Grok-3 vs Llama 3 8B Instruct37 Grok-3 vs Llama 2 70B Chat37 Grok-3 vs Qwen3.5-35B-A3B37 Grok-3 vs DeepSeek V336 Grok-3 vs Llama 3.1 405B Instruct34 Grok-3 vs Claude Opus 4.630 Grok-3 vs GPT-5.3-Codex28 Grok-3 vs GPT-5.425 Grok-3 vs Tencent Hunyuan Turbo S25 Grok-3 vs GPT-5.4-Cyber22 Grok-3 vs Qwen3.5-122B-A10B20 Grok-3 vs Claude Sonnet 4.520 Grok-3 vs Phi-3 Mini 4k18 Grok-3 vs GPT-5.218 Grok-3 vs Llama 3.2 1B Instruct17 Grok-3 vs DeepSeek V3.215 Grok-3 vs Gemma 7B Instruct14 Grok-3 vs DeepSeek R1 Lite14 Grok-3 vs Mixtral 8x7B13 Grok-3 vs Mixtral 8x22B Instruct v0.313 Grok-3 vs Mixtral 8x22B v0.113 Grok-3 vs Phi-4 Mini Reasoning13 Grok-3 vs Code Cushman 00112 Grok-3 vs Qwen3.6 Max Preview11 Grok-3 vs o3 Deep Research11 Grok-3 vs Together AI Qwen2-7B-Instruct10 Grok-3 vs Code Davinci 00110 Grok-3 vs Qwen3-Max5 Grok-3 vs Qwen3-235B-A22B5 Grok-3 vs Gemini 2.5 Flash Live API4 Grok-3 vs Code Cushman 0023 Grok-3 vs Qwen3.5-397B-A17B3 Grok-3 vs Phi-4 Mini Flash Reasoning3 Grok-3 vs DeepSeek V3 Base2 Grok-3 vs Gemini 2.5 Pro Computer Use Preview2 Grok-3 vs Mistral Nemotron2 Grok-3 vs Qwen3.5-4B2 Grok-3 vs GPT-5.4 Pro1 Grok-3 vs Qwen2-7B-Instruct1 Grok-3 vs GLM-5V-Turbo1

Frequently asked questions

What is the context window of Grok-3?

Grok-3 has a context window of 131k tokens.

How much does Grok-3 cost?

Grok-3 pricing ranges from $0.8/1M to $3/1M input tokens depending on the provider.

When was Grok-3 released?

Grok-3 was released on 2025-02-17.

Which providers offer Grok-3?

Grok-3 is available from 4 providers: OpenRouter, xAI Console, Chutes AI, Microsoft Foundry.

What benchmarks has Grok-3 been tested on?

Grok-3 has been evaluated on 8 benchmarks, including Aider Polyglot, Chatbot Arena, Massive Multi-discipline Multimodal Understanding, MMLU PRO, Google-Proof Q&A.

Created by

xAI

Ethical AI for universal truth-seeking

San Francisco, California, United States

Founded 2023

Website

Pricing

Output / 1M

$2.40

Input / 1M

$0.800

Cheapest of 4 routes · Chutes AI

Providers(4)

OpenRouter xAI Console Chutes AI Microsoft Foundry

View 4 provider routes

Grok-3

Use it for

Do not use it for

About

Top use-case fit: coding, agents, and build tasks

Coding

RAG

Agents

Provider price ladder

Available via routers & gateways(6)

LiteLLM

OpenRouter

Portkey

Azure AI Foundry Model Router

Helicone

Kong AI Gateway

Capabilities

Benchmark peer barsfor Coding

Benchmark scores(8)

Migration checks

Compare Grok-3 with other models

Comparison and alternatives

Frequently asked questions