ERNIE Speed Pro
ERNIE Speed Pro is worth evaluating for long context when its provider route and context window match the workload.
Use it for
- Teams evaluating long context
- Workloads that can use a 128k context window
- Buyers comparing 1 tracked provider route
Do not use it for
- Vision or document-understanding workloads
- Strict JSON or tool-calling flows
- Family
- ERNIE
- Released
- 2025-01-01
- Context
- 128k
- Max output
- 4,096
- Architecture
- Decoder Only
- Specialization
- general
- Openness
- Proprietary
- License
- ProprietaryCommercial use: conditional
- Training
- Pretrained
Cheapest of 1 route · Baidu Qianfan
About
ERNIE Speed Pro is Baidu's high-throughput lightweight model, available via the Qianfan API. It offers 128K context with efficient inference at very low cost ($0.044/1M input tokens). Pro-tier successor to the legacy free ERNIE Speed model. API model ID: ernie-speed-pro-128k. Exact release date not publicly documented.
ERNIE Speed Pro is a proprietary model in the ERNIE family. The structured metadata tracks a 128k-token context window. This page tracks provider routes through Baidu Qianfan, with the cheapest tracked route listed at $0.044 input and $0.089 output per 1M tokens. No headline benchmark score is tracked for ERNIE Speed Pro yet.
Top use-case fit
Long context
Included by capability and metadata signals in the decision map.
Provider price ladder
Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.
| Provider | Input / 1M | Output / 1M | Route |
|---|---|---|---|
| Baidu Qianfan | $0.044 | $0.089 | Serverless |
Capabilities
No model capability flags are currently sourced.
Benchmark peer barsfor Long context
No task-mapped benchmark peers are available for this model yet.
Migration checks
No linked migration route is available for this model yet.
Cheapest of 1 route · Baidu Qianfan