LLM ReferenceLLM Reference

Qwen3-VL Models by Alibaba

6 models2025Up to 256K ctxFrom $0.13/1M input

About

Qwen3-VL is a family of 6 AI models by Alibaba, released in 2025.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

6 in view

Use when the workload needs structured outputs and multimodal inputs.

2025-12structured outputsmultimodal inputs

Use when the workload needs 128K context, 32B parameters, and tool use.

2025-09128K context32B parameterstool use

Use when the workload needs 128K context, 8B parameters, and tool use.

2025-09128K context8B parameterstool use

Use when the workload needs 128K context, 30B parameters, and tool use.

2025-09128K context30B parameterstool use

Use when the workload needs 256K context, 235B parameters, and tool use.

2025-09256K context235B parameterstool use

Use when the workload needs multimodal, 30B parameters, and structured outputs.

2025-01multimodal30B parametersstructured outputs

Release Timeline

3 release groups
2025-12
1 current
Qwen3-VL-235B-A22B
structured outputsmultimodal inputs
Current
2025-09
4 current
Qwen3 VL 235B A22B Instruct
256K context235B parameterstool use
Current
Qwen3 VL 30B A3B Instruct
128K context30B parameterstool use
Current
Qwen3 VL 32B Instruct
128K context32B parameterstool use
Current
Qwen3 VL 8B Instruct
128K context8B parameterstool use
Current
2025-01
1 current
Qwen3-VL-30B-A3B
multimodal30B parametersstructured outputs
Current

Specifications(6 models)

Qwen3-VL model specifications comparison
ModelReleasedContextParametersVisionMultimodalFn CallingTool UseStructured Outputs
Qwen3-VL-235B-A22B2025-12NoYesNoNoYes
Qwen3 VL 32B Instruct2025-09128K32BYesYesYesYesYes
Qwen3 VL 8B Instruct2025-09128K8BYesYesYesYesYes
Qwen3 VL 30B A3B Instruct2025-09128K30BYesYesYesYesYes
Qwen3 VL 235B A22B Instruct2025-09256K235BYesYesYesYesYes
Qwen3-VL-30B-A3B2025-0130BYesYesNoNoYes

Available From(3 providers)

Pricing

Qwen3-VL model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Qwen3-VL-30B-A3BOpenRouter$0.13$1.56Serverless
Qwen3-VL-30B-A3BFireworks AI$0.15$0.6Serverless
Qwen3-VL-235B-A22BOpenRouter$0.26$2.6Serverless
Qwen3-VL-235B-A22BAWS Bedrock$0.53$2.66Serverless

Frequently Asked Questions

What is Qwen3-VL used for?
Qwen3-VL is used for multimodal, vision and multimodal work, and agent workflows and tool use. The family description and listed model capabilities point to those workloads as the best fit.
How does Qwen3-VL compare to Tongyi DeepResearch?
Qwen3-VL by Alibaba is strongest where you need multimodal, while Tongyi DeepResearch by Alibaba is the closest related family to check for adjacent model selection. Qwen3-VL has 6 listed variants and reaches up to 256K context, while Tongyi DeepResearch reaches up to 131K context, so compare the specs and pricing tables before choosing a production model.
Which Qwen3-VL model should I use?
For the lowest listed input price, start with Qwen3-VL-30B-A3B through OpenRouter at $0.13/1M input tokens. For the most capable/latest local choice, evaluate Qwen3 VL 235B A22B Instruct with 256K context and tool use, function calling, structured outputs, and multimodal inputs.

Models(6)