LLM ReferenceLLM Reference

Qwen VL

About

Multimodal vision-language model processing images and text for visual understanding.

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution

Providers(1)

ProviderInput (per 1M)Output (per 1M)Type
Replicate API$0.05$0.25Serverless

Rankings

Specifications

FamilyQwen VL
Released2023-11-30
Parameters7B
Context32K
ArchitectureDecoder Only
Specializationgeneral
Trainingfinetuning

Created by

AI research institute of Alibaba Group.

Hangzhou, Zhejiang, China
Founded 2017
Website

Providers(1)