GPT-4o-mini
GPT-4o-mini is worth evaluating for rag, long context, and vision when its provider route and context window match the workload.
Use it for
- Teams evaluating rag, long context, and vision
- Workloads that can use a 128k context window
- Buyers comparing 4 tracked provider routes
Do not use it for
- Vision or document-understanding workloads
- Family
- GPT-4o
- Released
- 2024-07-18
- Context
- 128k
- Max output
- 16,384
- Architecture
- Decoder Only
- Knowledge cutoff
- 2023-10
- Specialization
- general
- Openness
- Proprietary
- License
- ProprietaryCommercial use: conditional
Cheapest of 4 routes · OpenAI API · cache read $0.075
About
OpenAI: GPT-4o-mini available via OpenRouter. Pricing: $0.15/1M input, $0.6/1M output.
GPT-4o-mini is a proprietary model in the GPT-4o family. The structured metadata tracks a 128k-token context window and structured outputs. This page tracks provider routes through OpenAI API, Azure OpenAI, OpenRouter, and 1 more, with the cheapest tracked route listed at $0.15 input and $0.6 output per 1M tokens. Headline tracked benchmarks include Chatbot Arena 1235.0, Massive Multi-discipline Multimodal Understanding 59.4, and MMMU Pro 55.3.
Top use-case fit
RAG
Included by capability and metadata signals in the decision map.
Long context
Included by capability and metadata signals in the decision map.
Vision
Q/$ A1 relevant benchmark in the decision map.
Provider price ladder
Compare all 4Compare API pricing across 4 providers for input and output tokens, batch, and cached reads when available.
| Provider | Input / 1M | Output / 1M | Cache | Route |
|---|---|---|---|---|
| OpenAI API | $0.150 | $0.600 | read $0.075 | Serverless |
| OpenRouter | $0.150 | $0.600 | - | Serverless |
| Vercel AI Gateway | $0.150 | $0.600 | read $0.075 | Serverless |
| Azure OpenAI | - | - | - | ServerlessPartial |
Available via routers & gateways(16)
LiteLLM
GatewayOpen-source Python SDK and proxy server that unifies 100+ LLM APIs behind a single OpenAI-compatible interface, with load balancing, cost tracking, and configurable failover.
OpenRouter
HybridUnified hybrid gateway to 400+ models from 60+ providers via a single OpenAI-compatible API, with optional auto-routing that selects the best model per prompt.
Portkey
GatewayProduction AI gateway routing to 1,600+ LLMs with failover, load balancing, semantic caching, and guardrails; Apache 2.0 core is fully self-hostable with the complete feature set.
AIRouter
RouterCommercial LLM router that analyzes incoming requests and routes to the optimal model for cost/quality/latency via a drop-in OpenAI-compatible API, with a privacy-preserving embedding mode that avoids sending prompt content.
Azure AI Foundry Model Router
RouterMicrosoft Azure AI Foundry's native model router that uses a trained ML model to route each prompt in real time to the optimal Azure-hosted model, with Balanced/Cost/Quality mode selection and automatic failover.
Helicone
GatewayObservability-first AI gateway with routing, caching, rate limiting, and request tracing; Apache 2.0 open-source core with a managed hosted tier for logging and analytics.
Capabilities
Benchmark peer barsfor Vision
Benchmark scores(3)
| Benchmark | Score | Version | Source |
|---|---|---|---|
| Chatbot Arena | 1235.0 | — | https://lmarena.ai |
| Massive Multi-discipline Multimodal Understanding | 59.4 | — | https://mmmu-benchmark.github.io/ |
| MMMU Pro | 55.3 | standard 4-option (original paper harness) | https://arxiv.org/html/2409.02813v3 |
Migration checks
No linked migration route is available for this model yet.
Compare GPT-4o-mini with other models
Comparison and alternatives
Browse all comparisons →Frequently asked questions
What is the context window of GPT-4o-mini?
GPT-4o-mini has a context window of 128k tokens.
What is the max output of GPT-4o-mini?
GPT-4o-mini can generate up to 16,384 output tokens.
How much does GPT-4o-mini cost?
GPT-4o-mini is available at $0.15/1M input tokens through OpenAI API.
When was GPT-4o-mini released?
GPT-4o-mini was released on 2024-07-18.
Which providers offer GPT-4o-mini?
GPT-4o-mini is available from 4 providers: OpenAI API, Azure OpenAI, OpenRouter, Vercel AI Gateway.
What benchmarks has GPT-4o-mini been tested on?
GPT-4o-mini has been evaluated on 3 benchmarks, including Chatbot Arena, Massive Multi-discipline Multimodal Understanding, MMMU Pro.
Cheapest of 4 routes · OpenAI API · cache read $0.075