LLM ReferenceLLM Reference

Qwen3.5-Omni Flash

qwen3.5-omni-flash

ProprietaryMultimodal

About

Qwen3.5-Omni Flash is Alibaba's lower-latency omnimodal API model, released March 30, 2026. It keeps the Qwen3.5-Omni text, image, audio, and video input surface while reducing cost and latency for short video analysis and high-throughput multimodal workloads. API model ID: qwen3.5-omni-flash.

Qwen3.5-Omni Flash has a 256K-token context window.

Qwen3.5-Omni Flash input tokens at $0.1/1M, output at $0.8/1M.

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode ExecutionPrompt CachingBatch APIAudioFine-tuning

Providers(1)

ProviderInput (per 1M)Output (per 1M)Type
Alibaba Cloud PAI-EAS$0.10$0.80Serverless

Rankings

Specifications

Released2026-03-30
Context262K
Specializationgeneral
LicenseProprietary
Trainingpretrained

Created by

AI research institute of Alibaba Group.

Hangzhou, Zhejiang, China
Founded 2017
Website