Qwen3-VL Models by Alibaba
Details
Capabilities
About
Qwen3-VL is a family of 6 AI models by Alibaba, released in 2025.
Current Variants
Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.
Use when the workload needs 128k context, structured outputs, and multimodal inputs.
Use when the workload needs 128k context, 32B parameters, and tool use.
Use when the workload needs 128k context, 8B parameters, and tool use.
Use when the workload needs 128k context, 30B parameters, and tool use.
Use when the workload needs 256k context, 235B parameters, and tool use.
Use when the workload needs multimodal, 128k context, and 30B parameters.
| Model | Use when | Released | Signals | Status |
|---|---|---|---|---|
| Qwen3-VL-235B-A22B | Use when the workload needs 128k context, structured outputs, and multimodal inputs. | 2025-12 | 128k contextstructured outputsmultimodal inputs | Current |
| Qwen3 VL 32B Instruct | Use when the workload needs 128k context, 32B parameters, and tool use. | 2025-09 | 128k context32B parameterstool use | Current |
| Qwen3 VL 8B Instruct | Use when the workload needs 128k context, 8B parameters, and tool use. | 2025-09 | 128k context8B parameterstool use | Current |
| Qwen3 VL 30B A3B Instruct | Use when the workload needs 128k context, 30B parameters, and tool use. | 2025-09 | 128k context30B parameterstool use | Current |
| Qwen3 VL 235B A22B Instruct | Use when the workload needs 256k context, 235B parameters, and tool use. | 2025-09 | 256k context235B parameterstool use | Current |
| Qwen3-VL-30B-A3B | Use when the workload needs multimodal, 128k context, and 30B parameters. | 2025-01 | multimodal128k context30B parameters | Current |
Release Timeline
3 release groupsSpecifications(6 models)
| Model | Released | Context | Parameters | Vision | Multimodal | Fn Calling | Tool Use | Structured Outputs |
|---|---|---|---|---|---|---|---|---|
| Qwen3-VL-235B-A22B | 2025-12 | 128k | 235B (22B active) | Yes | Yes | No | No | Yes |
| Qwen3 VL 32B Instruct | 2025-09 | 128k | 32B | Yes | Yes | Yes | Yes | Yes |
| Qwen3 VL 8B Instruct | 2025-09 | 128k | 8B | Yes | Yes | Yes | Yes | Yes |
| Qwen3 VL 30B A3B Instruct | 2025-09 | 128k | 30B | Yes | Yes | Yes | Yes | Yes |
| Qwen3 VL 235B A22B Instruct | 2025-09 | 256k | 235B | Yes | Yes | Yes | Yes | Yes |
| Qwen3-VL-30B-A3B | 2025-01 | 128k | 30B | Yes | Yes | No | No | Yes |
Available From(5 providers)
Pricing
| Model | Provider | Input / 1M | Output / 1M | Type |
|---|---|---|---|---|
| Qwen3 VL 8B Instruct | Novita AI | $0.08 | $0.5 | Serverless |
| Qwen3-VL-30B-A3B | OpenRouter | $0.13 | $1.56 | Serverless |
| Qwen3-VL-30B-A3B | Fireworks AI | $0.15 | $0.6 | Serverless |
| Qwen3 VL 30B A3B Instruct | Novita AI | $0.2 | $0.7 | Serverless |
| Qwen3-VL-235B-A22B | OpenRouter | $0.26 | $2.6 | Serverless |
| Qwen3 VL 235B A22B Instruct | Novita AI | $0.3 | $1.5 | Serverless |
| Qwen3 VL 235B A22B Instruct | Vercel AI Gateway | $0.4 | $1.6 | Serverless |
| Qwen3-VL-235B-A22B | AWS Bedrock | $0.53 | $2.66 | Serverless |
| Qwen3-VL-235B-A22B | Novita AI | $0.98 | $3.95 | Serverless |
Frequently Asked Questions
- What is Qwen3-VL used for?
- Qwen3-VL is used for multimodal, vision and multimodal work, and agent workflows and tool use. The family description and listed model capabilities point to those workloads as the best fit.
- How does Qwen3-VL compare to Tongyi DeepResearch?
- Qwen3-VL by Alibaba is strongest where you need multimodal, while Tongyi DeepResearch by Alibaba is the closest related family to check for adjacent model selection. Qwen3-VL has 6 listed variants and reaches up to 256k context, while Tongyi DeepResearch reaches up to 131k context, so compare the specs and pricing tables before choosing a production model.
- Which Qwen3-VL model should I use?
- For the lowest listed input price, start with Qwen3 VL 8B Instruct through Novita AI at $0.08/1M input tokens. For the most capable/latest local choice, evaluate Qwen3 VL 235B A22B Instruct with 256k context and tool use, function calling, structured outputs, and multimodal inputs.
Models(6)
Qwen3-VL-235B-A22B
Qwen3 VL 32B Instruct
Qwen3 VL 8B Instruct
Qwen3 VL 30B A3B Instruct
Qwen3 VL 235B A22B Instruct
Qwen3-VL-30B-A3B






