LLM Reference

Qwen3.5-397B-A17B

Researched 28d ago

Last refreshed 2026-05-22. Next refresh: weekly.

Open SourceMultimodalCodingRAGAgentsLong contextVisionClassificationJSON / Tool use

Qwen3.5-397B-A17B is worth evaluating for coding, rag, and agents when its provider route and context window match the workload.

Decision context: Coding task fit, 4 tracked provider routes, and research from 2026-05-05.

Use it for

  • Teams evaluating coding, rag, and agents
  • Workloads that can use a 262k context window
  • Buyers comparing 4 tracked provider routes

Do not use it for

  • Workloads where another current model has stronger sourced task evidence

Featured in Picks

Cheapest output

$2.34

Alibaba Cloud PAI-EAS per 1M tokens

Provider routes

4

Tracked API hosts

Quality / dollar

Grade C

Ranked by benchmark score divided by cheapest output price

Freshness

2026-05-05

Researched 28d ago

fresh

Top use-case fit

Coding

Q/$ C

1 relevant benchmark in the decision map.

RAG

Included by capability and metadata signals in the decision map.

Agents

Q/$ B

1 relevant benchmark in the decision map.

Provider price ladder

Compare all 4
ProviderInput / 1MOutput / 1MRoute
Alibaba Cloud PAI-EAS$0.390$2.34
Serverless
OpenRouter$0.390$2.34
Serverless
Novita AI$0.600$3.60
Serverless
Together AI$0.600$3.60
Serverless

Benchmark peer barsfor Coding

Migration checks

No linked migration route is available for this model yet.

About

Alibaba's largest Qwen3.5 model, featuring a Mixture-of-Experts architecture with 397B total parameters and 17B active per token (using 512 total experts with 10 routed + 1 shared active). Supports 201 languages with a native 262K token context window extensible to 1M tokens via YaRN. Includes a thinking/reasoning mode, tool calling with MCP integration, and unified vision-language capabilities through early fusion training.

Qwen3.5-397B-A17B is an open-source model in the Qwen3.5 family. The structured metadata tracks a 262k-token context window, multimodal input, reasoning, function calling, tool use, and structured outputs. This page tracks provider routes through OpenRouter, Together AI, Alibaba Cloud PAI-EAS, and 1 more, with the cheapest tracked route listed at $0.39 input and $2.34 output per 1M tokens. Headline tracked benchmarks include Google-Proof Q&A 89.3, MMLU PRO 87.8, and Massive Multi-discipline Multimodal Understanding 85.0.

Capabilities

MultimodalReasoningFunction CallingTool UseStructured Outputs

Benchmark Scores(6)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.
BenchmarkScoreVersionSource
Google-Proof Q&A89.3diamondArtificial Analysis
MMLU PRO87.8https://huggingface.co/Qwen/Qwen3.5-397B-A17B
Massive Multi-discipline Multimodal Understanding85.0https://huggingface.co/Qwen/Qwen3.5-397B-A17B
Instruction-Following Evaluation92.6https://huggingface.co/Qwen/Qwen3.5-397B-A17B
BFCL72.9v4https://huggingface.co/Qwen/Qwen3.5-397B-A17B
SWE-bench Verified76.2SWE-bench Verifiedhttps://benchlm.ai/benchmarks/sweVerified

Rankings

Show all 53 popular comparisonssorted by 7-day search impressions
Qwen3.5-397B-A17B vs Claude Haiku 4.534Qwen3.5-397B-A17B vs GLM-5 Turbo30Qwen3.5-397B-A17B vs Llama 3 8B Instruct30Qwen3.5-397B-A17B vs GPT-4o29Qwen3.5-397B-A17B vs Claude 3.7 Sonnet27Qwen3.5-397B-A17B vs Mistral Medium 3.526Qwen3.5-397B-A17B vs DeepSeek V3.125Qwen3.5-397B-A17B vs Grok 424Qwen3.5-397B-A17B vs Mistral Large 221Qwen3.5-397B-A17B vs Qwen2.5-72B-Instruct20Qwen3.5-397B-A17B vs Llama 3 70B Instruct20Qwen3.5-397B-A17B vs GLM-4V 9B18Qwen3.5-397B-A17B vs Llama 3.2 11B Vision16Qwen3.5-397B-A17B vs Qwen2-VL-72B-Instruct15Qwen3.5-397B-A17B vs Gemma 7B Instruct15Qwen3.5-397B-A17B vs Grok 4.315Qwen3.5-397B-A17B vs Claude Opus 4.513Qwen3.5-397B-A17B vs o313Qwen3.5-397B-A17B vs DeepSeek V312Qwen3.5-397B-A17B vs Phi-3 Mini 4k12Qwen3.5-397B-A17B vs Llama 3.1 70B Instruct12Qwen3.5-397B-A17B vs Qwen3.5-Flash12Qwen3.5-397B-A17B vs Gemini 3 Pro12Qwen3.5-397B-A17B vs GPT-5.3-Codex-Spark12Qwen3.5-397B-A17B vs DeepSeek R111Qwen3.5-397B-A17B vs GPT-5.211Qwen3.5-397B-A17B vs GPT-5.510Qwen3.5-397B-A17B vs Phi 3.5 Vision Instruct9Qwen3.5-397B-A17B vs Gemini 2.5 Flash9Qwen3.5-397B-A17B vs DeepSeek R1 05288Qwen3.5-397B-A17B vs GPT Realtime 27Qwen3.5-397B-A17B vs gpt-realtime-1.57Qwen3.5-397B-A17B vs Gemini 3.1 Flash-Lite7Qwen3.5-397B-A17B vs Qwen3.5-35B-A3B6Qwen3.5-397B-A17B vs GPT-5.5-Cyber6Qwen3.5-397B-A17B vs Llama 3.2 1B Instruct6Qwen3.5-397B-A17B vs GPT-5.5 Instant6Qwen3.5-397B-A17B vs Llama 3.2 90B Vision5Qwen3.5-397B-A17B vs GPT Realtime Translate4Qwen3.5-397B-A17B vs Claude Mythos Preview4Qwen3.5-397B-A17B vs Llama 2 13B Chat4Qwen3.5-397B-A17B vs Trinity-Large-Thinking3Qwen3.5-397B-A17B vs o3 Mini3Qwen3.5-397B-A17B vs Mistral Medium 3 Instruct3Qwen3.5-397B-A17B vs GPT-5.4 Pro3Qwen3.5-397B-A17B vs Gemini 2.5 Pro3Qwen3.5-397B-A17B vs Grok-33Qwen3.5-397B-A17B vs gpt-realtime3Qwen3.5-397B-A17B vs Mistral Large 2 (2407)2Qwen3.5-397B-A17B vs Mistral Medium 31Qwen3.5-397B-A17B vs Phi-3 Silica1Qwen3.5-397B-A17B vs DeepSeek R1 Lite1Qwen3.5-397B-A17B vs Mixtral 8x7B0

Specifications

FamilyQwen3.5
Released2026-02-16
Parameters397B
Context262k
ArchitectureMoE
LicenseApache 2.0

Created by

AI research institute of Alibaba Group.

Hangzhou, Zhejiang, China
Founded 2017
Website