GPT-4o-mini
gpt-4o-mini
Last refreshed 2026-05-11. Next refresh: weekly.
GPT-4o-mini is worth evaluating for rag, long context, and vision when its provider route and context window match the workload.
Decision context: Vision task fit, 3 tracked provider routes, and research from 2026-05-10.
Use it for
- Teams evaluating rag, long context, and vision
- Workloads that can use a 128K context window
- Buyers comparing 3 tracked provider routes
Do not use it for
- Vision or document-understanding workloads
Cheapest output
$0.600
OpenAI API per 1M tokens
Provider routes
3
Tracked API hosts
Quality / dollar
Grade A
Ranked by benchmark score divided by cheapest output price
Freshness
2026-05-10
Researched 8d ago
Top use-case fit
RAG
Included by capability and metadata signals in the decision map.
Long context
Included by capability and metadata signals in the decision map.
Vision
Q/$ A1 relevant benchmark in the decision map.
Provider price ladder
| Provider | Input / 1M | Output / 1M | Cache | Route |
|---|---|---|---|---|
| OpenAI API | $0.150 | $0.600 | read $0.075 | Serverless |
| OpenRouter | $0.150 | $0.600 | - | Serverless |
| Azure OpenAI | - | - | - | ServerlessPartial |
Benchmark peer barsfor Vision
Migration checks
No linked migration route is available for this model yet.
About
OpenAI: GPT-4o-mini available via OpenRouter. Pricing: $0.15/1M input, $0.6/1M output.
GPT-4o-mini has a 128K-token context window.
GPT-4o-mini input tokens at $0.15/1M, output at $0.6/1M.
Capabilities
Benchmark Scores(2)
| Benchmark | Score | Version | Source |
|---|---|---|---|
| Chatbot Arena | 1235.0 | — | https://lmarena.ai |
| Massive Multi-discipline Multimodal Understanding | 59.4 | — | https://mmmu-benchmark.github.io/ |