Qwen3 VL 8B Instruct
qwen3-vl-8b-instruct
Open SourceMultimodal
About
Qwen3-VL-8B-Instruct is a compact 8B multimodal vision-language model from Alibaba, delivering high-fidelity image understanding and grounding at 128K context.
Qwen3 VL 8B Instruct has a 128K-token context window.
Capabilities
VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution