Claude 3.5 Sonnet

Name: Claude 3.5 Sonnet
Author: Anthropic

Released

2024-06-20

Last refreshed

2026-06-15

Status

Researched 90d ago

ProprietaryCommercial use: conditionalMultimodalCodingRAGAgentsLong contextVisionClassificationJSON / Tool use

Claude 3.5 Sonnet is worth evaluating for coding, rag, and agents when its provider route and context window match the workload.

Use it for

Teams evaluating coding, rag, and agents
Workloads that can use a 200k context window
Buyers comparing 4 tracked provider routes

Do not use it for

Workloads where another current model has stronger sourced task evidence

Specifications

Family: Claude 3.5
Released: 2024-06-20
Context: 200k
Parameters: 70B
Architecture: Decoder Only
Knowledge cutoff: 2024-04
Specialization: general
Openness: Proprietary
License: ProprietaryCommercial use: conditional
Weights: Not released
Code: Unknown
Training: Fine-tuned

Created by

Anthropic

Developing safe and ethical AI systems.

San Francisco, California, United States

Founded 2021

Website

Pricing

Output / 1M

$15.00

Input / 1M

$3.00

Cheapest of 6 routes · Anthropic · cache read $0.300

Providers(6)

GCP Vertex AI AWS Bedrock Anthropic OpenRouter Microsoft Foundry Replicate API

View 6 provider routes

About

Claude 3.5 Sonnet, the latest in Anthropic's line of large language models, merges state-of-the-art reasoning, coding, and natural language understanding capabilities with advanced multi-modal processing. Released in October 2024, it excels in benchmarks against previous models and competitors, thanks to its scalable attention mechanisms and massive neural network architecture. Its dynamic routing enables specialization in various tasks, supporting applications from software development and data analysis to customer support and content creation. Users benefit from its "Artifacts" feature for real-time collaborative workflows and can access the model through platforms like Claude.ai and APIs at competitive pricing rates.

Claude 3.5 Sonnet is a proprietary model in the Claude 3.5 family. The structured metadata tracks a 200k-token context window, multimodal input, reasoning, function calling, structured outputs, and code execution. This page tracks provider routes through GCP Vertex AI, AWS Bedrock, Anthropic, and 3 more, with the cheapest tracked route listed at $3 input and $15 output per 1M tokens. Headline tracked benchmarks include HellaSwag 96.2, HumanEval 92.0, and Massive Multitask Language Understanding 88.7.

Top use-case fit: coding, agents, and build tasks

Coding

Q/$ D

5 relevant benchmarks in the decision map.

RAG

Included by capability and metadata signals in the decision map.

Agents

Q/$ D

1 relevant benchmark in the decision map.

Provider price ladder

Compare all 6

Compare API pricing across 4 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Cache	Route
Anthropic	$3.00	$15.00	read $0.300 / 5m $3.75 / 1h $6.00	Serverless
AWS Bedrock	$3.00	$15.00	-	Serverless
GCP Vertex AI	$3.00	$15.00	-	Serverless
Replicate API	$3.00	$15.00	-	Serverless

Available via routers & gateways(16)

LiteLLM

Gateway

Open-source Python SDK and proxy server that unifies 100+ LLM APIs behind a single OpenAI-compatible interface, with load balancing, cost tracking, and configurable failover.

Free OSSAnthropicGCP Vertex AIMicrosoft Foundry

OpenRouter

Hybrid

Unified hybrid gateway to 400+ models from 60+ providers via a single OpenAI-compatible API, with optional auto-routing that selects the best model per prompt.

PassthroughAnthropicGCP Vertex AI

Portkey

Gateway

Production AI gateway routing to 1,600+ LLMs with failover, load balancing, semantic caching, and guardrails; Apache 2.0 core is fully self-hostable with the complete feature set.

SubscriptionAnthropicGCP Vertex AIMicrosoft Foundry

AIRouter

Router

Commercial LLM router that analyzes incoming requests and routes to the optimal model for cost/quality/latency via a drop-in OpenAI-compatible API, with a privacy-preserving embedding mode that avoids sending prompt content.

Passthrough + feeAnthropicGCP Vertex AI

Amazon Bedrock Intelligent Prompt Routing

Router

AWS Bedrock's native intelligent prompt router that routes prompts between Anthropic Claude model tiers (Haiku/Sonnet) based on predicted task complexity, with no extra per-routing charge.

PassthroughAWS Bedrock

Azure AI Foundry Model Router

Router

Microsoft Azure AI Foundry's native model router that uses a trained ML model to route each prompt in real time to the optimal Azure-hosted model, with Balanced/Cost/Quality mode selection and automatic failover.

PassthroughMicrosoft Foundry

Capabilities

VisionMultimodalReasoningFunction CallingStructured OutputsCode Execution

Benchmark peer barsfor Coding

SWE-bench VerifiedRank 78 of 80

Claude Fable 5

96.0

Claude Mythos Preview

93.9

Claude Opus 4.8

88.6

Claude Opus 4.7

87.6

Claude 3.5 Sonnetcurrent

49.0

HumanEvalRank 12 of 97

Claude Sonnet 4.6

98.0

96.7

Claude Opus 4.6

95.0

Grok-3

94.5

Claude 3.5 Sonnetcurrent

92.0

LiveCodeBenchRank 52 of 55

DeepSeek V4 Pro

93.5

Gemini 3.1 Pro Preview

91.7

DeepSeek V4 Flash

91.6

Qwen3.7-Max

91.6

Claude 3.5 Sonnetcurrent

48.7

Benchmark scores(12)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.

Benchmark	Score	Version	Evaluation	Source
HellaSwag	96.2	10-shotObserved 2026-03-06	—	Source
HumanEval	92.0	pass@1Observed 2026-03-06	—	Source
Massive Multitask Language Understanding	88.7	5-shotObserved 2026-03-06	—	Source
SWE-bench Verified	49.0	2024-10Observed 2026-04-09	—	Source
LiveCodeBench	48.7	2026-04Observed 2026-04-14	—	Source
Aider Polyglot	51.6	2026-04Observed 2026-04-14	—	Source
BigCodeBench	44.6	2025-01 (Instruct Pass@1)Observed 2026-04-14	—	Source
Chatbot Arena	1340.0	—Observed 2026-04-15	—	Source
Massive Multi-discipline Multimodal Understanding	68.3	—Observed 2026-04-15	—	Source
MMLU PRO	77.2	—Observed 2026-04-14	—	Source
MMMU Pro	63.7	standard 4-option (original paper harness)Observed 2024-09-04	—	Source
Mostly Basic Programming Problems+	78.8	—Observed 2026-05-28	—	Source

Migration checks

No linked migration route is available for this model yet.

Compare Claude 3.5 Sonnet with other models

Comparison and alternatives

Browse all comparisons →

Show all 3 popular comparisonssorted by 7-day search impressions

Claude 3.5 Sonnet vs Step 3.7 Flash5 Claude 3.5 Sonnet vs Together AI Qwen2-7B-Instruct4 Claude 3.5 Sonnet vs Gemini 2.5 Pro Computer Use Preview2

Frequently asked questions

What is the context window of Claude 3.5 Sonnet?

Claude 3.5 Sonnet has a context window of 200k tokens.

How much does Claude 3.5 Sonnet cost?

Claude 3.5 Sonnet is available at $3/1M input tokens through GCP Vertex AI.

When was Claude 3.5 Sonnet released?

Claude 3.5 Sonnet was released on 2024-06-20.

Which providers offer Claude 3.5 Sonnet?

Claude 3.5 Sonnet is available from 6 providers: GCP Vertex AI, AWS Bedrock, Anthropic, OpenRouter, Microsoft Foundry, Replicate API.

What benchmarks has Claude 3.5 Sonnet been tested on?

Claude 3.5 Sonnet has been evaluated on 12 benchmarks, including HellaSwag, HumanEval, Massive Multitask Language Understanding, SWE-bench Verified, LiveCodeBench.

Created by

Anthropic

Developing safe and ethical AI systems.

San Francisco, California, United States

Founded 2021

Website

Pricing

Output / 1M

$15.00

Input / 1M

$3.00

Cheapest of 6 routes · Anthropic · cache read $0.300

Providers(6)

GCP Vertex AI AWS Bedrock Anthropic OpenRouter Microsoft Foundry Replicate API

View 6 provider routes