Qwen3-Coder-480B-A35B-Instruct

Name: Qwen3-Coder-480B-A35B-Instruct
Author: Alibaba

Released

2025-07-22

Last refreshed

2026-06-29

Status

Researched 13d ago

Open sourceCommercial use: permittedCodingRAGAgentsLong contextClassificationJSON / Tool use

Qwen3-Coder-480B-A35B-Instruct is worth evaluating for coding, rag, and agents when its provider route and context window match the workload.

Use it for

Teams evaluating coding, rag, and agents
Workloads that can use a 262k context window
Buyers comparing 4 tracked provider routes

Do not use it for

Vision or document-understanding workloads

Specifications

Family: Qwen3-Coder
Released: 2025-07-22
Context: 262k
Parameters: 480B total, 35B active
Architecture: Mixture of Experts
Specialization: code
Openness: Open source
License: Apache 2.0OSI-approvedCommercial use: permitted
Training: Pretrained

Created by

Alibaba

AI research institute of Alibaba Group.

Hangzhou, Zhejiang, China

Founded 2017

Website

Pricing

Output / 1M

$1.55

Input / 1M

$0.380

Cheapest of 6 routes · Novita AI

Providers(6)

Fireworks AI GCP Vertex AI NVIDIA NIM AWS Bedrock Vercel AI Gateway Novita AI

View 6 provider routes

Links

Website HuggingFace

About

Qwen3-Coder-480B-A35B-Instruct is Alibaba's flagship open-source code generation and agentic model, released July 22, 2025 under the Apache 2.0 license. The model has 480 billion total parameters with 35 billion active parameters per token, organized across 62 transformer layers with 160 specialized expert networks and 8 experts activated per token. It uses Grouped Query Attention (GQA) with 96 query heads and 8 key-value heads and supports a native context window of 262,144 tokens, extendable to 1 million tokens via YaRN position scaling. The model is purpose-built for software engineering tasks and agentic workflows: code generation, code review, test writing, multi-step debugging, and browser-based agentic task execution. On release, it achieved state-of-the-art results among open models on Agentic Coding, Agentic Browser-Use, and Agentic Tool-Use benchmarks, with performance comparable to Claude Sonnet 4 on these tasks. Available via Fireworks AI, Google Vertex AI, NVIDIA NIM, AWS Bedrock, Novita AI, and the Vercel AI Gateway.

The model is purpose-built for software engineering tasks and agentic workflows: code generation, code review, test writing, multi-step debugging, and browser-based agentic task execution. On release, it achieved state-of-the-art results among open models on Agentic Coding, Agentic Browser-Use, and Agentic Tool-Use benchmarks, with performance comparable to Claude Sonnet 4 on these tasks. The instruction-tuned variant supports function calling and structured output natively.

Qwen3-Coder-480B-A35B-Instruct is available via Fireworks AI, Google Vertex AI, NVIDIA NIM, AWS Bedrock, Novita AI, and the Vercel AI Gateway. It is the largest model in the Qwen3 Coder family and represents the top open-source coding capability from Alibaba as of mid-2025. The 256K native context window accommodates large codebases, multi-file sessions, and long agentic task traces within a single context.

Qwen3-Coder-480B-A35B-Instruct has a 262k-token context window.

Qwen3-Coder-480B-A35B-Instruct input tokens at $0.22/1M, output at $1.8/1M.

Top use-case fit: coding, agents, and build tasks

Coding

Q/$ D

2 relevant benchmarks in the decision map.

RAG

Included by capability and metadata signals in the decision map.

Agents

Q/$ B

2 relevant benchmarks in the decision map.

Provider price ladder

Compare all 6

Compare API pricing across 4 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Cache	Route
Novita AI	$0.380	$1.55	-	Serverless
GCP Vertex AI	$0.220	$1.80	-	Serverless
Vercel AI Gateway	$1.50	$7.50	read $0.300	Serverless
AWS Bedrock	-	-	-	ServerlessPartial

Available via routers & gateways(15)

LiteLLM

Gateway

Open-source Python SDK and proxy server that unifies 100+ LLM APIs behind a single OpenAI-compatible interface, with load balancing, cost tracking, and configurable failover.

Free OSSGCP Vertex AI

OpenRouter

Hybrid

Unified hybrid gateway to 400+ models from 60+ providers via a single OpenAI-compatible API, with optional auto-routing that selects the best model per prompt.

PassthroughGCP Vertex AIFireworks AI

Portkey

Gateway

Production AI gateway routing to 1,600+ LLMs with failover, load balancing, semantic caching, and guardrails; Apache 2.0 core is fully self-hostable with the complete feature set.

SubscriptionGCP Vertex AI

AIRouter

Router

Commercial LLM router that analyzes incoming requests and routes to the optimal model for cost/quality/latency via a drop-in OpenAI-compatible API, with a privacy-preserving embedding mode that avoids sending prompt content.

Passthrough + feeGCP Vertex AI

Amazon Bedrock Intelligent Prompt Routing

Router

AWS Bedrock's native intelligent prompt router that routes prompts between Anthropic Claude model tiers (Haiku/Sonnet) based on predicted task complexity, with no extra per-routing charge.

PassthroughAWS Bedrock

Helicone

Gateway

Observability-first AI gateway with routing, caching, rate limiting, and request tracing; Apache 2.0 open-source core with a managed hosted tier for logging and analytics.

SubscriptionGCP Vertex AI

Capabilities

Function CallingTool UseStructured OutputsCode Execution

Benchmark peer barsfor Coding

SWE-bench ProRank 41 of 41

80.3

73.7

69.2

64.3

Qwen3-Coder-480B-A35B-Instructcurrent

38.7

SWE-bench VerifiedRank 69 of 80

Claude Fable 5

96.0

Claude Mythos Preview

93.9

Claude Opus 4.8

88.6

Claude Opus 4.7

87.6

Qwen3-Coder-480B-A35B-Instructcurrent

66.5

Benchmark scores(3)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.

Benchmark	Score	Version	Source
Berkeley Function Calling Leaderboard v3	68.7	Berkeley Function Calling Leaderboard (BFCL v3)	https://gorilla.cs.berkeley.edu/leaderboard.html
SWE-bench Pro	38.7	Scale AI standardized SWE-bench Pro	https://labs.scale.com/leaderboard/swe_bench_pro_public
SWE-bench Verified	66.5	Nebius/OpenHands independent SWE-bench Verified	https://nebius.com/blog/posts/openhands-trajectories-with-qwen3-coder-480b

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(1)

Best AI Agent Models 2026: SWE-bench RankedListed

Frequently asked questions

What is the context window of Qwen3-Coder-480B-A35B-Instruct?

Qwen3-Coder-480B-A35B-Instruct has a context window of 262k tokens.

How much does Qwen3-Coder-480B-A35B-Instruct cost?

Qwen3-Coder-480B-A35B-Instruct pricing ranges from $0.22/1M to $1.5/1M input tokens depending on the provider.

When was Qwen3-Coder-480B-A35B-Instruct released?

Qwen3-Coder-480B-A35B-Instruct was released on 2025-07-22.

Which providers offer Qwen3-Coder-480B-A35B-Instruct?

Qwen3-Coder-480B-A35B-Instruct is available from 6 providers: Fireworks AI, GCP Vertex AI, NVIDIA NIM, AWS Bedrock, Vercel AI Gateway, Novita AI.

What benchmarks has Qwen3-Coder-480B-A35B-Instruct been tested on?

Qwen3-Coder-480B-A35B-Instruct has been evaluated on 3 benchmarks, including Berkeley Function Calling Leaderboard v3, SWE-bench Pro, SWE-bench Verified.

Created by

Alibaba

AI research institute of Alibaba Group.

Hangzhou, Zhejiang, China

Founded 2017

Website

Pricing

Output / 1M

$1.55

Input / 1M

$0.380

Cheapest of 6 routes · Novita AI

Providers(6)

Fireworks AI GCP Vertex AI NVIDIA NIM AWS Bedrock Vercel AI Gateway Novita AI

View 6 provider routes

Links

Website HuggingFace