Qwen3 VL 32B Instruct
Qwen3 VL 32B Instruct has model metadata, but missing tracked provider pricing keeps it from being a default production pick.
Use it for
- Teams evaluating rag, agents, and long context
- Workloads that can use a 128k context window
Do not use it for
- Cost-sensitive launches that need sourced token pricing
- Teams that need a tracked hosted API route today
- Family
- Qwen3-VL
- Released
- 2025-09-18
- Context
- 128k
- Parameters
- 32B
- Architecture
- Decoder Only
- Specialization
- general
- Openness
- Open source
- License
- Apache 2.0(OSI)Commercial use allowed
- Training
- pretrained
No tracked provider token pricing is available yet.
About
Qwen3-VL-32B-Instruct is a 32B multimodal vision-language model from Alibaba's Qwen3-VL series, delivering high-precision image understanding and reasoning at 128K context.
Qwen3 VL 32B Instruct is an open-source model in the Qwen3-VL family. The structured metadata tracks a 128k-token context window, multimodal input, function calling, tool use, and structured outputs. Headline tracked benchmarks include MMMU Pro 68.1.
Top use-case fit: coding, agents, and build tasks
RAG
Included by capability and metadata signals in the decision map.
Agents
Included by capability and metadata signals in the decision map.
Long context
Included by capability and metadata signals in the decision map.
Provider price ladder
No tracked provider token pricing is available for this model yet.
Capabilities
Benchmark peer barsfor RAG
No task-mapped benchmark peers are available for this model yet.
Benchmark scores(1)
| Benchmark | Score | Version | Source |
|---|---|---|---|
| MMMU Pro | 68.1 | LLM-Stats aggregator, thinking mode (highest) | https://llm-stats.com/benchmarks/mmmu-pro |
Migration checks
No linked migration route is available for this model yet.
No tracked provider token pricing is available yet.