LLM ReferenceLLM Reference

Qwen3 VL 8B Instruct

qwen3-vl-8b-instruct

Open SourceMultimodal

About

Qwen3-VL-8B-Instruct is a compact 8B multimodal vision-language model from Alibaba, delivering high-fidelity image understanding and grounding at 128K context.

Qwen3 VL 8B Instruct has a 128K-token context window.

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution

Rankings

Specifications

FamilyQwen3-VL
Released2026-02-01
Parameters8B
Context128K
ArchitectureDecoder Only
Specializationgeneral
LicenseApache 2.0
Trainingpretrained

Created by

AI research institute of Alibaba Group.

Hangzhou, Zhejiang, China
Founded 2017
Website