Claude Opus 4.8

Name: Claude Opus 4.8
Author: Anthropic

Released

2026-05-28

Last refreshed

2026-06-29

Status

Researched 17d ago

ProprietaryCommercial use: conditionalMultimodalCodingRAGAgentsLong contextVisionJSON / Tool useHighlight

Claude Opus 4.8 is worth evaluating for coding, rag, and agents when its provider route and context window match the workload.

Use it for

Teams evaluating coding, rag, and agents
Workloads that can use a 1m context window
Buyers comparing 4 tracked provider routes

Do not use it for

Workloads where another current model has stronger sourced task evidence

Specifications

Family: Claude 4.8
Released: 2026-05-28
Context: 1m
Max output: 128,000
Architecture: Decoder Only
Knowledge cutoff: 2026-01
Specialization: general
Openness: Proprietary
License: ProprietaryCommercial use: conditional
Weights: Not released
Code: Not released
Training: Fine-tuned

Created by

Anthropic

Developing safe and ethical AI systems.

San Francisco, California, United States

Founded 2021

Website

Pricing

Output / 1M

$25.00

Input / 1M

$5.00

Cheapest of 6 routes · Anthropic · cache read $0.500

Providers(6)

Anthropic AWS Bedrock GCP Vertex AI Microsoft Foundry OpenRouter Vercel AI Gateway

View 6 provider routes

Links

Website

About

Claude Opus 4.8 is Anthropic's flagship Claude 4.8 model, released May 28, 2026 for agentic coding, long-horizon reasoning, computer use, and professional knowledge work. It supports text and image inputs, adaptive reasoning, tool use, structured outputs, computer-use tools, prompt caching, Batch API, Dynamic Workflows parallel subagents, a 1M-token context window on Anthropic API/Bedrock/Vertex, and 128K max output. Key datapack rows: SWE-bench Pro 69.2%, SWE-bench Verified 88.6%, Terminal-Bench 2.1 74.6%, HLE with tools 57.9%, OSWorld-Verified 83.4%, GDPval-AA 1890 Elo, and MCP-Atlas 82.2%. Standard Anthropic API pricing is $5/M input and $25/M output.

Claude Opus 4.8 is a proprietary model in the Claude 4.8 family. The structured metadata tracks a 1m-token context window, multimodal input, reasoning, function calling, tool use, structured outputs, and code execution. This page tracks provider routes through Anthropic, AWS Bedrock, GCP Vertex AI, and 3 more, with the cheapest tracked route listed at $5 input and $25 output per 1M tokens. Headline tracked benchmarks include Finance Agent v2 53.9, SWE-bench Verified 88.6, and SWE-bench Pro 69.2.

Top use-case fit: coding, agents, and build tasks

Coding

Q/$ D

3 relevant benchmarks in the decision map.

RAG

Included by capability and metadata signals in the decision map.

Agents

Q/$ D

1 relevant benchmark in the decision map.

Provider price ladder

Compare all 6

Compare API pricing across 4 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Batch in / out	Cache	Route
Anthropic	$5.00	$25.00	$2.50 / $12.50	read $0.500 / 5m $6.25 / 1h $10.00	Serverless
OpenRouter	$5.00	$25.00	-	-	Serverless
AWS Bedrock	-	-	-	-	ServerlessPartial
GCP Vertex AI	-	-	-	-	ServerlessPartial

Available via routers & gateways(16)

LiteLLM

Gateway

Open-source Python SDK and proxy server that unifies 100+ LLM APIs behind a single OpenAI-compatible interface, with load balancing, cost tracking, and configurable failover.

Free OSSAnthropicGCP Vertex AIMicrosoft Foundry

OpenRouter

Hybrid

Unified hybrid gateway to 400+ models from 60+ providers via a single OpenAI-compatible API, with optional auto-routing that selects the best model per prompt.

PassthroughAnthropicGCP Vertex AI

Portkey

Gateway

Production AI gateway routing to 1,600+ LLMs with failover, load balancing, semantic caching, and guardrails; Apache 2.0 core is fully self-hostable with the complete feature set.

SubscriptionAnthropicGCP Vertex AIMicrosoft Foundry

AIRouter

Router

Commercial LLM router that analyzes incoming requests and routes to the optimal model for cost/quality/latency via a drop-in OpenAI-compatible API, with a privacy-preserving embedding mode that avoids sending prompt content.

Passthrough + feeAnthropicGCP Vertex AI

Amazon Bedrock Intelligent Prompt Routing

Router

AWS Bedrock's native intelligent prompt router that routes prompts between Anthropic Claude model tiers (Haiku/Sonnet) based on predicted task complexity, with no extra per-routing charge.

PassthroughAWS Bedrock

Azure AI Foundry Model Router

Router

Microsoft Azure AI Foundry's native model router that uses a trained ML model to route each prompt in real time to the optimal Azure-hosted model, with Balanced/Cost/Quality mode selection and automatic failover.

PassthroughMicrosoft Foundry

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode ExecutionPrompt CachingBatch API

Benchmark peer barsfor Coding

SWE-bench ProRank 2 of 41

Claude Fable 5

80.3

Claude Opus 4.8current

69.2

Grok 4.5

64.7

GPT-5.6 Sol

64.6

Claude Opus 4.7

64.3

SWE-bench VerifiedRank 3 of 80

Claude Fable 5

96.0

Claude Mythos Preview

93.9

Claude Opus 4.8current

88.6

Claude Opus 4.7

87.6

Claude Sonnet 5

85.2

LiveCodeBenchRank 7 of 55

DeepSeek V4 Pro

93.5

Gemini 3.1 Pro Preview

91.7

DeepSeek V4 Flash

91.6

Qwen3.7-Max

91.6

Claude Opus 4.8current

88.8

Benchmark scores(16)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.

Benchmark	Score	Version	Source
Finance Agent v2	53.9	—	https://www.anthropic.com/news/claude-opus-4-8
SWE-bench Verified	88.6	SWE-bench Verified	https://www.anthropic.com/news/claude-opus-4-8
SWE-bench Pro	69.2	SWE-bench Pro	https://www.anthropic.com/news/claude-opus-4-8
Terminal-Bench 2.1	74.6	Terminal-Bench 2.1	https://www.anthropic.com/news/claude-opus-4-8
Google-Proof Q&A	93.6	GPQA Diamond	https://www.anthropic.com/news/claude-opus-4-8
Humanity's Last Exam — No Tools	49.8	HLE no tools	https://www.anthropic.com/news/claude-opus-4-8
Humanity's Last Exam — With Tools	57.9	HLE with tools	https://www.anthropic.com/news/claude-opus-4-8
BrowseComp — Single Agent	84.3	BrowseComp single-agent	https://codingfleet.com/blog/claude-opus-4-8-vs-gpt-5-5-comparison/
BrowseComp — Multi-Agent	88.5	BrowseComp multi-agent Dynamic Workflows	https://www.anthropic.com/news/claude-opus-4-8
OSWorld-Verified	83.4	OSWorld-Verified	https://www.anthropic.com/news/claude-opus-4-8
GDPval-AA	1890.0	GDPval-AA ELO	https://www.anthropic.com/news/claude-opus-4-8
MCP-Atlas	82.2	MCP-Atlas	https://codingfleet.com/blog/claude-opus-4-8-vs-gpt-5-5-comparison/
ARC-AGI-2 — High Effort	72.1	ARC-AGI-2 high effort	https://codingfleet.com/blog/claude-opus-4-8-vs-gpt-5-5-comparison/
LiveCodeBench	88.8	LiveCodeBench	https://codingfleet.com/blog/claude-opus-4-8-vs-gpt-5-5-comparison/
CursorBench	63.8	CursorBench 3.1	https://cursor.com/evals
GeneBench-Pro	16.0	max	https://cdn.openai.com/pdf/21938268-21af-442f-af93-3b2249afb241/genebench-pro.pdf

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(4)

Best for CodingStrong Best AI Agent Models 2026: SWE-bench RankedListed Best LLMs for Reasoning & MathListed Best Mainstream LLM APIs, RankedListed

Compare Claude Opus 4.8 with other models

Comparison and alternatives

Browse all comparisons →

Claude Opus 4.8 vs GLM-5.2 Claude Opus 4.8 vs Qwen3.7-Plus Claude Opus 4.8 vs Claude Fable 5 Claude Opus 4.8 vs Claude Sonnet 5 Claude Opus 4.8 vs Claude Opus 4.7 Claude Opus 4.8 vs GPT-5.3-Codex Claude Opus 4.8 vs GPT-5.5 Claude Opus 4.8 vs Claude Sonnet 4.6 Claude Opus 4.8 vs Grok 4 Claude Opus 4.8 vs GPT-5.5 Pro Claude Opus 4.8 vs Gemini 3.1 Pro Preview Claude Opus 4.8 vs Gemini 3.5 Pro Claude Opus 4.8 vs DeepSeek V4 Pro Claude Opus 4.8 vs Kimi K2.6 Claude Opus 4.8 vs Unisound U2 Claude Opus 4.8 vs Grok 4 Heavy

Show all 5 popular comparisonssorted by 7-day search impressions

Claude Opus 4.8 vs Composer 2.517 Claude Opus 4.8 vs Gemini 3.1 Flash-Lite4 Claude Opus 4.8 vs GPT-5.42 Claude Opus 4.8 vs GPT-5.4-Cyber1 Claude Opus 4.8 vs o3-pro1

Frequently asked questions

What is the context window of Claude Opus 4.8?

Claude Opus 4.8 has a context window of 1m tokens.

What is the max output of Claude Opus 4.8?

Claude Opus 4.8 can generate up to 128,000 output tokens.

How much does Claude Opus 4.8 cost?

Claude Opus 4.8 is available at $5.00/1M input tokens through Anthropic.

When was Claude Opus 4.8 released?

Claude Opus 4.8 was released on 2026-05-28.

Which providers offer Claude Opus 4.8?

Claude Opus 4.8 is available from 6 providers: Anthropic, AWS Bedrock, GCP Vertex AI, Microsoft Foundry, OpenRouter, Vercel AI Gateway.

What benchmarks has Claude Opus 4.8 been tested on?

Claude Opus 4.8 has been evaluated on 16 benchmarks, including Finance Agent v2, SWE-bench Verified, SWE-bench Pro, Terminal-Bench 2.1, Google-Proof Q&A.

Created by

Anthropic

Developing safe and ethical AI systems.

San Francisco, California, United States

Founded 2021

Website

Pricing

Output / 1M

$25.00

Input / 1M

$5.00

Cheapest of 6 routes · Anthropic · cache read $0.500

Providers(6)

Anthropic AWS Bedrock GCP Vertex AI Microsoft Foundry OpenRouter Vercel AI Gateway

View 6 provider routes

Links

Website