Qwen3.5-397B-A17B

Name: Qwen3.5-397B-A17B
Author: Alibaba

Released

2026-02-16

Last refreshed

2026-06-29

Status

Researched 43d ago

Open sourceCommercial use: permittedMultimodalCodingRAGAgentsLong contextVisionClassificationJSON / Tool use

Qwen3.5-397B-A17B is worth evaluating for coding, rag, and agents when its provider route and context window match the workload.

Use it for

Teams evaluating coding, rag, and agents
Workloads that can use a 262k context window
Buyers comparing 4 tracked provider routes

Do not use it for

Workloads where another current model has stronger sourced task evidence

Specifications

Family: Qwen3.5
Released: 2026-02-16
Context: 262k
Parameters: 397B
Architecture: Mixture of Experts
Openness: Open source
License: Apache 2.0OSI-approvedCommercial use: permitted
Weights: Available
Code: Unknown

Created by

Alibaba

AI research institute of Alibaba Group.

Hangzhou, Zhejiang, China

Founded 2017

Website

Pricing

Output / 1M

$2.34

Input / 1M

$0.390

Cheapest of 4 routes · Alibaba Cloud PAI-EAS

Providers(4)

OpenRouter Together AI Alibaba Cloud PAI-EAS Novita AI

View 4 provider routes

About

Alibaba's largest Qwen3.5 model, featuring a Mixture-of-Experts architecture with 397B total parameters and 17B active per token (using 512 total experts with 10 routed + 1 shared active). Supports 201 languages with a native 262K token context window extensible to 1M tokens via YaRN. Includes a thinking/reasoning mode, tool calling with MCP integration, and unified vision-language capabilities through early fusion training.

Qwen3.5-397B-A17B is an open-source model in the Qwen3.5 family. The structured metadata tracks a 262k-token context window, multimodal input, reasoning, function calling, tool use, and structured outputs. This page tracks provider routes through OpenRouter, Together AI, Alibaba Cloud PAI-EAS, and 1 more, with the cheapest tracked route listed at $0.39 input and $2.34 output per 1M tokens. Headline tracked benchmarks include Google-Proof Q&A 89.3, MMLU PRO 87.8, and Massive Multi-discipline Multimodal Understanding 85.0.

Top use-case fit: coding, agents, and build tasks

Coding

Q/$ C

2 relevant benchmarks in the decision map.

RAG

Included by capability and metadata signals in the decision map.

Agents

Q/$ C

4 relevant benchmarks in the decision map.

Provider price ladder

Compare all 4

Compare API pricing across 4 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Route
Alibaba Cloud PAI-EAS	$0.390	$2.34	Serverless
OpenRouter	$0.390	$2.34	Serverless
Novita AI	$0.600	$3.60	Serverless
Together AI	$0.600	$3.60	Serverless

Available via routers & gateways(1)

OpenRouter

Hybrid

Unified hybrid gateway to 400+ models from 60+ providers via a single OpenAI-compatible API, with optional auto-routing that selects the best model per prompt.

PassthroughTogether AI

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured Outputs

Benchmark peer barsfor Coding

SWE-bench VerifiedRank 38 of 80

Claude Fable 5

96.0

Claude Mythos Preview

93.9

Claude Opus 4.8

88.6

Claude Opus 4.7

87.6

Qwen3.5-397B-A17Bcurrent

76.2

LiveCodeBenchRank 19 of 55

DeepSeek V4 Pro

93.5

Gemini 3.1 Pro Preview

91.7

DeepSeek V4 Flash

91.6

Qwen3.7-Max

91.6

Qwen3.5-397B-A17Bcurrent

83.6

Benchmark scores(13)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.

Benchmark	Score	Version	Source
Google-Proof Q&A	89.3	diamond	Artificial Analysis
MMLU PRO	87.8	From official HuggingFace model card (accuracy)	https://huggingface.co/Qwen/Qwen3.5-397B-A17B
Massive Multi-discipline Multimodal Understanding	85.0	—	https://huggingface.co/Qwen/Qwen3.5-397B-A17B
Instruction-Following Evaluation	92.6	—	https://huggingface.co/Qwen/Qwen3.5-397B-A17B
BFCL	72.9	v4	https://huggingface.co/Qwen/Qwen3.5-397B-A17B
SWE-bench Verified	76.2	SWE-bench Verified	https://benchlm.ai/benchmarks/sweVerified
AIME 2026	91.3	AIME 2026 (accuracy)	https://huggingface.co/Qwen/Qwen3.5-397B-A17B
Berkeley Function Calling Leaderboard v3	72.9	BFCL-V4, from official model card (accuracy)	https://huggingface.co/Qwen/Qwen3.5-397B-A17B
Humanity's Last Exam	28.7	HLE with CoT, no tools, from official model card (accuracy)	https://huggingface.co/Qwen/Qwen3.5-397B-A17B
LiveCodeBench	83.6	LiveCodeBench v6 (pass@1)	https://huggingface.co/Qwen/Qwen3.5-397B-A17B
MultiChallenge	67.6	Multi-Challenge leaderboard rank 2 of 28 (accuracy%)	https://llm-stats.com/benchmarks/multichallenge
τ-bench	86.7	TAU2-Bench, from official model card (accuracy)	https://huggingface.co/Qwen/Qwen3.5-397B-A17B
Terminal-Bench 2.0	52.5	Terminal-Bench 2.0 (accuracy%)	https://llm-stats.com/benchmarks/terminal-bench-2

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(8)

Best for Tool useSolid Best for Open weightsSolid Best for TranslationStrong Best AI Agent Models 2026: SWE-bench RankedListed Best Open Source LLMsListed Best Multimodal / Vision LLMsListed Best Free LLMs You Can Use Right NowListed Best LLMs for Customer SupportListed

Compare Qwen3.5-397B-A17B with other models

Comparison and alternatives

Browse all comparisons →

Show all 53 popular comparisonssorted by 7-day search impressions

Frequently asked questions

What is the context window of Qwen3.5-397B-A17B?

Qwen3.5-397B-A17B has a context window of 262k tokens.

How much does Qwen3.5-397B-A17B cost?

Qwen3.5-397B-A17B pricing ranges from $0.39/1M to $0.6/1M input tokens depending on the provider.

When was Qwen3.5-397B-A17B released?

Qwen3.5-397B-A17B was released on 2026-02-16.

Which providers offer Qwen3.5-397B-A17B?

Qwen3.5-397B-A17B is available from 4 providers: OpenRouter, Together AI, Alibaba Cloud PAI-EAS, Novita AI.

What benchmarks has Qwen3.5-397B-A17B been tested on?

Qwen3.5-397B-A17B has been evaluated on 13 benchmarks, including Google-Proof Q&A, MMLU PRO, Massive Multi-discipline Multimodal Understanding, Instruction-Following Evaluation, BFCL.