LLM ReferenceLLM Reference

ERNIE 5.0

ernie-5.0

ProprietaryMultimodal

About

ERNIE 5.0 is Baidu's fifth-generation flagship foundation model, officially launched January 22, 2026 (preview at Baidu World November 13, 2025). It is a fully native multimodal model supporting text, image, audio, and video understanding and generation under a unified autoregressive framework, trained simultaneously across modalities from scratch. With 2.4 trillion total parameters and ultra-sparse MoE activation engaging <3% of parameters per inference, it delivers frontier performance at high efficiency. Available on Baidu AI Cloud's Qianfan MaaS platform. API model IDs: ernie-5.0; ernie-5.0-thinking-preview (thinking mode).

ERNIE 5.0 has a 128K-token context window.

ERNIE 5.0 input tokens at $0.89/1M, output at $3.54/1M.

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode ExecutionPrompt CachingBatch APIAudioFine-tuning

Providers(1)

ProviderInput (per 1M)Output (per 1M)Type
Baidu Qianfan$0.89$3.54Serverless

Rankings

Specifications

FamilyERNIE 5
Released2026-01-22
Parameters2.4T
Context128K
Max output65,536
Architecturemoe
Specializationgeneral
LicenseProprietary
Trainingpretrained

Created by

Innovative text-to-video and app builder

Beijing, China
Founded 2010
Website

Providers(1)