LLM Reference

ERNIE Models by Baidu AI

This model family is considered obsolete. Consider newer alternatives in Related Model Families below.
8 models2023–2025Up to 128k ctxFrom $0.03/1M input

About

ERNIE is a highly advanced family of large language models (LLMs) developed by Baidu, renowned for their proficiency in handling Chinese language tasks. The ERNIE family consists of several iterations, including ERNIE 1.0, ERNIE 2.0, ERNIE 3.0, ERNIE-Gram, and ERNIE-health, each offering diverse capabilities and model sizes. These models employ a transformer-based architecture and are pre-trained on vast amounts of data, encompassing textual corpora and knowledge graphs. This comprehensive pre-training equips ERNIE models with the ability to discern complex language patterns and structures, facilitating a broad spectrum of natural language processing applications, such as understanding and generating language. Notably, ERNIE 3.0 leverages knowledge graphs to enhance its comprehension and reasoning capabilities, while ERNIE Bot, powered by ERNIE 3.5, integrates plugins for real-time data retrieval and long text processing. These models have been embraced on multiple platforms, including Hugging Face's model hub, and have consistently delivered state-of-the-art performances on various benchmarks.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

8 in view
ERNIE X1.1Current

Use when the workload needs reasoning, 64k context, and tool use.

2025-09reasoning64k contexttool use

Use when the workload needs reasoning, 32k context, and tool use.

2025-04reasoning32k contexttool use
ERNIE 4.5Current

Use when the workload needs 8k context.

2025-038k context
ERNIE X1Current

Use when provider availability and model metadata match the workload.

2025-03

Use when the workload needs 128k context.

2025-01128k context

Use when the workload needs 128k context.

2025-01128k context

Use when the workload needs 8B parameters.

2023-108B parameters
ERNIE BotCurrent

Use when provider availability and model metadata match the workload.

2023-08

Release Timeline

6 release groups
2025-09
1 current
ERNIE X1.1
reasoning64k contexttool use
Current
2025-04
1 current
ERNIE X1 Turbo
reasoning32k contexttool use
Current
2025-03
2 current
ERNIE 4.5
8k context
Current
Current
2025-01
2 current
ERNIE Lite Pro
128k context
Current
ERNIE Speed Pro
128k context
Current
2023-10
1 current
ERNIE-4.0-8K
8B parameters
Current
2023-08
1 current
Current

Specifications(8 models)

ERNIE model specifications comparison
ModelReleasedContextParametersVisionMultimodalReasoningFn CallingTool Use
ERNIE X1.12025-0964kYesYesYesYesYes
ERNIE X1 Turbo2025-0432kYesYesYesYesYes
ERNIE 4.52025-038kNoNoNoNoNo
ERNIE X12025-03NoNoNoNoNo
ERNIE Speed Pro2025-01128kNoNoNoNoNo
ERNIE Lite Pro2025-01128kNoNoNoNoNo
ERNIE-4.0-8K2023-108BNoNoNoNoNo
ERNIE Bot2023-08NoNoNoNoNo

Available From(2 providers)

Pricing

ERNIE model pricing by provider
ModelProviderInput / 1MOutput / 1MType
ERNIE Lite ProBaidu Qianfan$0.03$0.059Serverless
ERNIE Speed ProBaidu Qianfan$0.044$0.089Serverless
ERNIE X1 TurboBaidu Qianfan$0.15$0.59Serverless
ERNIE X1.1Baidu Qianfan$0.15$0.59Serverless
ERNIE X1Baidu Qianfan$0.3$1.18Serverless
ERNIE 4.5Baidu Qianfan$0.59$2.36Serverless
ERNIE 4.5Fireworks AI$1.2$1.2Serverless

Frequently Asked Questions

What is ERNIE used for?
ERNIE is used for reasoning, vision and multimodal work, and agent workflows and tool use. The family description and listed model capabilities point to those workloads as the best fit.
How does ERNIE compare to ERNIE 4.5?
ERNIE by Baidu AI is strongest where you need reasoning, while ERNIE 4.5 by Baidu AI is the closest related family to check for vision and multimodal work. ERNIE has 8 listed variants and reaches up to 128k context, while ERNIE 4.5 reaches up to 128k context, so compare the specs and pricing tables before choosing a production model.
Which ERNIE model should I use?
For the lowest listed input price, start with ERNIE Lite Pro through Baidu Qianfan at $0.03/1M input tokens. For the most capable/latest local choice, evaluate ERNIE X1.1 with 64k context and reasoning, tool use, function calling, and multimodal inputs.

Models(8)