Qwen3-Max

Name: Qwen3-Max
Author: Alibaba

Released

2025-04-28

Last refreshed

2026-06-29

Status

Researched 92d ago

Open sourceCommercial use: permittedMultimodalCodingRAGAgentsLong contextVisionJSON / Tool use

Qwen3-Max is worth evaluating for coding, rag, and agents when its provider route and context window match the workload.

Use it for

Teams evaluating coding, rag, and agents
Workloads that can use a 262k context window
Buyers comparing 3 tracked provider routes

Do not use it for

Workloads where another current model has stronger sourced task evidence

Specifications

Family: Qwen3
Released: 2025-04-28
Context: 262k
Architecture: Decoder Only
Knowledge cutoff: 2025-12
Specialization: general
Openness: Open source
License: Apache 2.0OSI-approvedCommercial use: permitted
Weights: Unknown
Code: Unknown
Training: Fine-tuned

Created by

Alibaba

AI research institute of Alibaba Group.

Hangzhou, Zhejiang, China

Founded 2017

Website

Pricing

Output / 1M

$3.90

Input / 1M

$0.780

Cheapest of 3 routes · OpenRouter

Providers(3)

OpenRouter Vercel AI Gateway Novita AI

View 3 provider routes

Links

Website

About

Alibaba's Qwen3-Max, flagship model with improved multilingual and reasoning capabilities.

Qwen3-Max is an open-source model in the Qwen3 family. The structured metadata tracks a 262k-token context window, multimodal input, function calling, tool use, and structured outputs. This page tracks provider routes through OpenRouter, Vercel AI Gateway, and Novita AI, with the cheapest tracked route listed at $0.78 input and $3.9 output per 1M tokens. Headline tracked benchmarks include SWE-bench Verified 78.8, τ-bench 76.8, and Berkeley Function Calling Leaderboard v3 71.9.

Top use-case fit: coding, agents, and build tasks

Coding

Q/$ D

1 relevant benchmark in the decision map.

RAG

Included by capability and metadata signals in the decision map.

Agents

Q/$ C

4 relevant benchmarks in the decision map.

Provider price ladder

Compare all 3

Compare API pricing across 3 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Cache	Route
OpenRouter	$0.780	$3.90	-	Serverless
Vercel AI Gateway	$1.20	$6.00	read $0.240	Serverless
Novita AI	$2.11	$8.45	-	Serverless

Capabilities

VisionMultimodalFunction CallingTool UseStructured Outputs

Benchmark peer barsfor Coding

SWE-bench VerifiedRank 21 of 80

Claude Fable 5

96.0

Claude Mythos Preview

93.9

Claude Opus 4.8

88.6

Claude Opus 4.7

87.6

Qwen3-Maxcurrent

78.8

Benchmark scores(4)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.

Benchmark	Score	Version	Evaluation	Source
SWE-bench Verified	78.8	SWE-bench VerifiedObserved 2026-04-24	—	Source
τ-bench	76.8	τ-benchObserved 2026-04-24	—	Source
Berkeley Function Calling Leaderboard v3	71.9	Berkeley Function Calling Leaderboard (BFCL v3)Observed 2026-04-12	—	Source
MultiChallenge	41.2	MultiChallengeObserved 2026-04-26	—	Source