Claude Sonnet 4.6

Name: Claude Sonnet 4.6
Author: Anthropic

Released

2026-02-17

Last refreshed

2026-07-26

Status

Researched 50d ago

ProprietaryCommercial use: conditionalMultimodalCodingRAGAgentsLong contextVisionClassificationJSON / Tool useHighlight

Claude Sonnet 4.6 is worth evaluating for coding, rag, and agents when its provider route and context window match the workload.

Use it for

Teams evaluating coding, rag, and agents
Workloads that can use a 1m context window
Buyers comparing 4 tracked provider routes

Do not use it for

Workloads where another current model has stronger sourced task evidence

Specifications

Family: Claude 4.6
Released: 2026-02-17
Context: 1m
Max output: 64,000
Architecture: Decoder Only
Knowledge cutoff: 2025-08
Specialization: general
Openness: Proprietary
License: ProprietaryCommercial use: conditional
Weights: Not released
Code: Unknown
Training: Fine-tuned

Created by

Anthropic

Developing safe and ethical AI systems.

San Francisco, California, United States

Founded 2021

Website

Pricing

Output / 1M

$15.00

Input / 1M

$3.00

Cheapest of 6 routes · Anthropic · cache read $0.300

Providers(6)

OpenRouter Anthropic AWS Bedrock GCP Vertex AI Microsoft Foundry Vercel AI Gateway

View 6 provider routes

Links

Website

About

Claude Sonnet 4.6 is Anthropic's best combination of speed and intelligence. Proprietary decoder-only model with 1M-token context, 64K max output, multimodal vision, extended thinking, and function calling. Available via Anthropic API, AWS Bedrock, GCP Vertex AI, and OpenRouter at $3/1M input and $15/1M output tokens.

Claude Sonnet 4.6 is a proprietary model in the Claude 4.6 family. The structured metadata tracks a 1m-token context window, multimodal input, reasoning, function calling, tool use, structured outputs, and code execution. This page tracks provider routes through OpenRouter, Anthropic, AWS Bedrock, and 3 more, with the cheapest tracked route listed at $3 input and $15 output per 1M tokens. Headline tracked benchmarks include SWE-bench Verified 79.6, Terminal-Bench 2.0 59.1, and SWE-bench Multilingual 75.9.

Top use-case fit: coding, agents, and build tasks

Coding

Q/$ D

3 relevant benchmarks in the decision map.

RAG

Included by capability and metadata signals in the decision map.

Agents

Q/$ D

3 relevant benchmarks in the decision map.

Provider price ladder

Compare all 6

Compare API pricing across 4 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Batch in / out	Cache	Route
Anthropic	$3.00	$15.00	$1.50 / $7.50	read $0.300 / 5m $3.75 / 1h $6.00	Serverless
AWS Bedrock	$3.00	$15.00	-	-	Serverless
GCP Vertex AI	$3.00	$15.00	-	-	Serverless
Microsoft Foundry	$3.00	$15.00	-	read $0.300 / 5m $3.75 / 1h $6.00	Serverless

Available via routers & gateways(16)

LiteLLM

Gateway

Open-source Python SDK and proxy server that unifies 100+ LLM APIs behind a single OpenAI-compatible interface, with load balancing, cost tracking, and configurable failover.

Free OSSAnthropicGCP Vertex AIMicrosoft Foundry

OpenRouter

Hybrid

Unified hybrid gateway to 400+ models from 60+ providers via a single OpenAI-compatible API, with optional auto-routing that selects the best model per prompt.

PassthroughAnthropicGCP Vertex AI

Portkey

Gateway

Production AI gateway routing to 1,600+ LLMs with failover, load balancing, semantic caching, and guardrails; Apache 2.0 core is fully self-hostable with the complete feature set.

SubscriptionAnthropicGCP Vertex AIMicrosoft Foundry

AIRouter

Router

Commercial LLM router that analyzes incoming requests and routes to the optimal model for cost/quality/latency via a drop-in OpenAI-compatible API, with a privacy-preserving embedding mode that avoids sending prompt content.

Passthrough + feeAnthropicGCP Vertex AI

Amazon Bedrock Intelligent Prompt Routing

Router

AWS Bedrock's native intelligent prompt router that routes prompts between Anthropic Claude model tiers (Haiku/Sonnet) based on predicted task complexity, with no extra per-routing charge.

PassthroughAWS Bedrock

Azure AI Foundry Model Router

Router

Microsoft Azure AI Foundry's native model router that uses a trained ML model to route each prompt in real time to the optimal Azure-hosted model, with Balanced/Cost/Quality mode selection and automatic failover.

PassthroughMicrosoft Foundry

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode ExecutionPrompt CachingBatch API

Benchmark peer barsfor Coding

SWE-bench VerifiedRank 19 of 81

Claude Fable 5

96.0

Claude Opus 5

96.0

Claude Mythos Preview

93.9

Claude Opus 4.8

88.6

Claude Sonnet 4.6current

79.6

HumanEvalRank 1 of 97

Claude Sonnet 4.6current

98.0

96.7

Claude Opus 4.6

95.0

Grok-3

94.5

GPT-5.5

94.2

LiveCodeBenchRank 27 of 55

DeepSeek V4 Pro

93.5

Gemini 3.1 Pro Preview

91.7

DeepSeek V4 Flash

91.6

Qwen3.7-Max

91.6

Claude Sonnet 4.6current

80.0

Benchmark scores(19)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.

Benchmark	Score	Version	Evaluation	Source
SWE-bench Verified	79.6	SWE-bench VerifiedObserved 2026-02-17	—	Source
Terminal-Bench 2.0	59.1	Terminal-Bench 2.0Observed 2026-02-17	—	Source
SWE-bench Multilingual	75.9	SWE-bench MultilingualObserved 2026-05-21	—	Source
Google-Proof Q&A	89.9	diamondObserved 2026-04-28	—	Source
MMLU PRO	87.3	—Observed 2026-04-19	—	Source
τ-bench	87.5	τ-benchObserved 2026-04-24	—	Source
MultiChallenge	57.1	MultiChallengeObserved 2026-04-26	—	Source
Chatbot Arena	1459.0	—Observed 2026-04-28	—	Source
MMMU Pro	75.6	official Anthropic system card, adaptive thinking, max effort, with image cropping toolObserved 2026-02-17	—	Source
SWE-rebench	60.7	pass@1 (best of 5 runs)Observed 2026-05-28	—	Source
AIME 2025	94.0	AIME 2025 (accuracy)Observed 2026-06-07	—	Source
ARC-AGI-2	58.3	llm-stats shows 0 (accuracy%)Observed 2026-06-07	—	Source
Humanity's Last Exam	33.2	HLE without tools (accuracy)Observed 2026-06-07	—	Source
HumanEval	98.0	HumanEval (pass@1)Observed 2026-06-07	—	Source
LiveCodeBench	80.0	LiveCodeBench score (accuracy)Observed 2026-06-07	—	Source
MCP-Atlas	61.3	llm-stats shows 0 (accuracy%)Observed 2026-06-07	—	Source
Massive Multitask Language Understanding	89.3	MMLU (accuracy)Observed 2026-06-07	—	Source
Massive Multi-discipline Multimodal Understanding	83.6	MMMU (accuracy)Observed 2026-06-07	—	Source
CursorBench	49.0	CursorBench 3.1Observed 2026-06-30	Configuration: Sonnet 4.6 Max Harness: CursorBench 3.1 Evaluator: Cursor Confidence: confirmed Notes: Highest CursorBench 3.1 score across Cursor's published effort configurations for this base model.	Source

Migration checks

Coming from Claude 3.5 Sonnet?

Rankings & picks(10)

Best for CodingSolid Best for AgentsSolid Best for Tool useStrong Best for WritingSolid Best for SummarizationSolid Best for Docs Q&AEditor's Choice Best for TranslationSolid Best for Data & SQLStrong Best Multimodal / Vision LLMsListed Best LLMs for Function Calling & Tool UseListed

Compare Claude Sonnet 4.6 with other models

Comparison and alternatives

Browse all comparisons →

Show all 80 popular comparisonssorted by 7-day search impressions

Frequently asked questions

What is the context window of Claude Sonnet 4.6?

Claude Sonnet 4.6 has a context window of 1m tokens.

What is the max output of Claude Sonnet 4.6?

Claude Sonnet 4.6 can generate up to 64,000 output tokens.

How much does Claude Sonnet 4.6 cost?

Claude Sonnet 4.6 is available at $3/1M input tokens through OpenRouter.

When was Claude Sonnet 4.6 released?

Claude Sonnet 4.6 was released on 2026-02-17.

Which providers offer Claude Sonnet 4.6?

Claude Sonnet 4.6 is available from 6 providers: OpenRouter, Anthropic, AWS Bedrock, GCP Vertex AI, Microsoft Foundry, Vercel AI Gateway.

What benchmarks has Claude Sonnet 4.6 been tested on?

Claude Sonnet 4.6 has been evaluated on 19 benchmarks, including SWE-bench Verified, Terminal-Bench 2.0, SWE-bench Multilingual, Google-Proof Q&A, MMLU PRO.

Created by

Anthropic

Developing safe and ethical AI systems.

San Francisco, California, United States

Founded 2021

Website

Pricing

Output / 1M

$15.00

Input / 1M

$3.00

Cheapest of 6 routes · Anthropic · cache read $0.300