LLM ReferenceLLM Reference

Qwen3.5-397B-A17B

Open SourceMultimodal

About

Alibaba's largest Qwen 3.5 model, featuring a Mixture-of-Experts architecture with 397B total parameters and 17B active per token (using 512 total experts with 10 routed + 1 shared active). Supports 201 languages with a native 262K token context window extensible to 1M tokens via YaRN. Includes a thinking/reasoning mode, tool calling with MCP integration, and unified vision-language capabilities through early fusion training.

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution

Benchmark Scores(1)

BenchmarkScoreVersionSource
Google-Proof Q&A89.3diamondArtificial Analysis

Rankings

Specifications

FamilyQwen 3.5
Released2026-02-16
Parameters397B
Context262K
ArchitectureMoE
LicenseApache 2.0

Created by

AI research institute of Alibaba Group.

Hangzhou, Zhejiang, China
Founded 2017
Website