GPT-4o Models by OpenAI
OpenAIHighlight
11 models2024–2025Up to 128K ctxFrom $0.15/1M input
About
GPT-4o is OpenAI's most advanced model to date. This multimodal model handles both text and image inputs while generating text outputs. Matching the intelligence of GPT-4 Turbo, it is remarkably more efficient, delivering text at twice the speed and at half the cost. Additionally, GPT-4o exhibits the highest vision performance and excels in non-English languages compared to previous OpenAI models.
Specifications(11 models)
| Model | Released | Context | Parameters | Vision | Multimodal | Fn Calling | Tool Use | Structured Outputs | Code Exec |
|---|---|---|---|---|---|---|---|---|---|
| GPT-4o-mini Search Preview | 2025-02 | 128K | — | No | No | No | No | Yes | No |
| GPT-4o Search Preview | 2025-02 | 128K | — | No | No | No | No | Yes | No |
| GPT-4o (11-20) | 2024-11 | 128K | 1.76T (8x222B MoE)* | Yes | No | No | No | No | Yes |
| GPT-4o (2024-11-20) | 2024-11 | 128K | — | No | No | No | No | Yes | No |
| GPT-4o Audio | 2024-10 | 128K | — | No | No | No | No | No | No |
| GPT-4o-mini | 2024-07 | 128K | — | No | No | No | No | Yes | No |
| ChatGPT-4o | 2024-05 | 128K | — | Yes | No | No | No | No | Yes |
| GPT-4o | 2024-05 | 128K | — | Yes | Yes | Yes | Yes | Yes | Yes |
Available From(5 providers)
Pricing
| Model | Provider | Input / 1M | Output / 1M | Type |
|---|---|---|---|---|
| GPT-4o-mini | OpenAI API | $0.15 | $0.6 | Serverless |
| GPT-4o-mini | OpenRouter | $0.15 | $0.6 | Serverless |
| GPT-4o-mini Search Preview | OpenRouter | $0.15 | $0.6 | Serverless |
| GPT-4o | Replicate API | $2.5 | $10 | Serverless |
| GPT-4o Audio | OpenRouter | $2.5 | $10 | Serverless |
| GPT-4o Search Preview | OpenRouter | $2.5 | $10 | Serverless |
| GPT-4o (2024-11-20) | OpenRouter | $2.5 | $10 | Serverless |
| GPT-4o | OpenRouter | $2.5 | $10 | Serverless |
| GPT-4o | OpenAI API | $2.5 | $10 | Serverless |
Comparisons
- GPT-4o (08-06) vs Claude Sonnet 4.6
- GPT-4o (08-06) vs Claude Opus 4.6
- GPT-4o (08-06) vs Claude 3.5 Sonnet
- GPT-4o (08-06) vs Claude 3.7 Sonnet
- GPT-4o (08-06) vs Gemini 3.1 Pro
- GPT-4o (08-06) vs Gemini 2.5 Pro
- GPT-4o Mini (07-18) vs Gemini 2.5 Flash
- GPT-4o (11-20) vs Gemini 3.1 Pro
Frequently Asked Questions
- What is GPT-4o used for?
- GPT-4o is used for vision and multimodal work, agent workflows and tool use, and structured outputs. The family description and listed model capabilities point to those workloads as the best fit.
- How does GPT-4o compare to GPT Realtime 2?
- GPT-4o by OpenAI is strongest where you need vision and multimodal work, while GPT Realtime 2 by OpenAI is the closest related family to check for translation. GPT-4o has 11 listed variants and reaches up to 128K context, while GPT Realtime 2 reaches up to 131K context, so compare the specs and pricing tables before choosing a production model.
- Which GPT-4o model should I use?
- For the lowest listed input price, start with GPT-4o Mini (07-18) through OpenAI API at $0.15/1M input tokens. For the most capable/latest local choice, evaluate GPT-4o with 128K context and tool use, function calling, structured outputs, and multimodal inputs.



