LLM Reference

Claude Sonnet 5

Released
2026-06-30
Last refreshed
2026-06-30
Status
Researched today
ProprietaryCommercial use: conditionalMultimodalCodingRAGAgentsLong contextVisionJSON / Tool useHighlight

Claude Sonnet 5 is worth evaluating for coding, rag, and agents when its provider route and context window match the workload.

Use it for

  • Teams evaluating coding, rag, and agents
  • Workloads that can use a 1m context window
  • Buyers comparing 4 tracked provider routes

Do not use it for

  • Workloads where another current model has stronger sourced task evidence
Specifications
Family
Claude 5
Released
2026-06-30
Context
1m
Max output
128,000
Architecture
Decoder Only
Knowledge cutoff
2026-01
Specialization
general
Openness
Proprietary
License
ProprietaryCommercial use: conditional
Training
Fine-tuned
Created by

Developing safe and ethical AI systems.

San Francisco, California, United States
Founded 2021
Website
Pricing
Output / 1M
$10.00
Input / 1M
$2.00

Cheapest of 5 routes · OpenRouter · cache read $0.200

About

Claude Sonnet 5 is Anthropic's next-generation Sonnet model for agentic coding, tool use, computer use, and professional work. It is a proprietary decoder-only model with a 1M-token context window, 128K max output, multimodal vision, adaptive thinking, function calling, structured outputs, prompt caching, and Batch API support. It is available through the Claude API, AWS Bedrock, Google Cloud Vertex AI, Microsoft Foundry preview, and OpenRouter. Anthropic lists durable standard pricing at $3/1M input and $15/1M output tokens, with introductory $2/$10 pricing through 2026-08-31.

Claude Sonnet 5 is a proprietary model in the Claude 5 family. The structured metadata tracks a 1m-token context window, multimodal input, reasoning, function calling, tool use, structured outputs, and code execution. This page tracks provider routes through Anthropic, AWS Bedrock, GCP Vertex AI, and 2 more, with the cheapest tracked route listed at $2 input and $10 output per 1M tokens. Headline tracked benchmarks include SWE-bench Verified 85.2, SWE-bench Pro 63.2, and SWE-bench Multilingual 78.3.

Top use-case fit: coding, agents, and build tasks

Coding

Q/$ D

2 relevant benchmarks in the decision map.

RAG

Included by capability and metadata signals in the decision map.

Agents

Q/$ D

1 relevant benchmark in the decision map.

Provider price ladder

Compare all 5

Compare API pricing across 4 providers for input and output tokens, batch, and cached reads when available.

ProviderInput / 1MOutput / 1MBatch in / outCacheRoute
OpenRouter$2.00$10.00-read $0.200 / 5m $2.50 / 1h $4.00
Serverless
Anthropic$3.00$15.00$1.50 / $7.50read $0.300 / 5m $3.75 / 1h $6.00
Serverless
GCP Vertex AI$3.00$15.00$1.50 / $7.50read $0.300 / 5m $3.75 / 1h $6.00
Serverless
AWS Bedrock----
ServerlessPartial

Available via routers & gateways(16)

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode ExecutionPrompt CachingBatch API

Benchmark peer barsfor Coding

Benchmark scores(9)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.
BenchmarkScoreVersionSource
SWE-bench Verified85.2SWE-bench Verified; Anthropic standard configuration, adaptive thinking at max effort, default sampling, average over 5 trialshttps://www.anthropic.com/claude-sonnet-5-system-card
SWE-bench Pro63.2SWE-bench Pro; Anthropic standard configuration, adaptive thinking at max effort, default sampling, average over 5 trialshttps://www.anthropic.com/claude-sonnet-5-system-card
SWE-bench Multilingual78.3SWE-bench Multilingual; Anthropic standard configuration, adaptive thinking at max effort, default sampling, average over 5 trialshttps://www.anthropic.com/claude-sonnet-5-system-card
Terminal-Bench 2.180.4Terminal-Bench 2.1; mini-SWE-agent harness on GKE, 1x timeout rate, 3x memory ceiling, xhigh effort, 445 trialshttps://www.anthropic.com/claude-sonnet-5-system-card
Humanity's Last Exam — With Tools57.4Humanity's Last Exam with web search, web fetch, programmatic tool calling, and code execution; Claude Opus 4.6 grader; 1M total-token caphttps://www.anthropic.com/claude-sonnet-5-system-card
BrowseComp — Multi-Agent86.6BrowseComp multi-agent; web search, web fetch, programmatic tool calling, code execution; 10M-token limit with context compactionhttps://www.anthropic.com/claude-sonnet-5-system-card
BrowseComp84.7BrowseComp single-agent; adaptive thinking at maximum effort, 10M-token limit with context compaction triggered at 200khttps://www.anthropic.com/claude-sonnet-5-system-card
OSWorld-Verified81.2OSWorld-Verified; 361 tasks, 1080p Ubuntu VM, 100 action steps, adaptive thinking at max effort, pass@1 averaged over 5 runshttps://www.anthropic.com/claude-sonnet-5-system-card
GDP.pdf81.6GDP.pdf with Python tools and image cropping tool; internal harness, Opus 4.7 judge, mean criteria pass rate averaged over 5 runshttps://www.anthropic.com/claude-sonnet-5-system-card

Migration checks

No linked migration route is available for this model yet.

Frequently asked questions

What is the context window of Claude Sonnet 5?

Claude Sonnet 5 has a context window of 1m tokens.

What is the max output of Claude Sonnet 5?

Claude Sonnet 5 can generate up to 128,000 output tokens.

How much does Claude Sonnet 5 cost?

Claude Sonnet 5 pricing ranges from $2/1M to $3.00/1M input tokens depending on the provider.

When was Claude Sonnet 5 released?

Claude Sonnet 5 was released on 2026-06-30.

Which providers offer Claude Sonnet 5?

Claude Sonnet 5 is available from 5 providers: Anthropic, AWS Bedrock, GCP Vertex AI, OpenRouter, Microsoft Foundry.

What benchmarks has Claude Sonnet 5 been tested on?

Claude Sonnet 5 has been evaluated on 9 benchmarks, including SWE-bench Verified, SWE-bench Pro, SWE-bench Multilingual, Terminal-Bench 2.1, Humanity's Last Exam — With Tools.