ERNIE 4.5 Turbo VL
ernie-4.5-turbo-vl
ProprietaryMultimodal
About
ERNIE 4.5 Turbo VL is the vision-language variant of ERNIE 4.5 Turbo, launched April 24, 2025. Supports image and text inputs with strong visual understanding and multimodal reasoning at reduced cost. API model IDs: ernie-4.5-turbo-vl (primary, 128K context); ernie-4.5-turbo-vl-32k (32K context).
ERNIE 4.5 Turbo VL has a 128K-token context window.
ERNIE 4.5 Turbo VL input tokens at $0.44/1M, output at $1.33/1M.
Capabilities
VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode ExecutionPrompt CachingBatch APIAudioFine-tuning
Providers(1)
| Provider | Input (per 1M) | Output (per 1M) | Type | |
|---|---|---|---|---|
| Baidu Qianfan | $0.44 | $1.33 | Serverless |