Qwen3 VL 32B Instruct
qwen3-vl-32b-instruct
Open SourceMultimodal
About
Qwen3-VL-32B-Instruct is a 32B multimodal vision-language model from Alibaba's Qwen3-VL series, delivering high-precision image understanding and reasoning at 128K context.
Qwen3 VL 32B Instruct has a 128K-token context window.
Capabilities
VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution