LLM ReferenceLLM Reference

GPT-4o Models by OpenAI

OpenAIHighlight
11 models2024–2025Up to 128K ctxFrom $0.15/1M input

About

GPT-4o is OpenAI's most advanced model to date. This multimodal model handles both text and image inputs while generating text outputs. Matching the intelligence of GPT-4 Turbo, it is remarkably more efficient, delivering text at twice the speed and at half the cost. Additionally, GPT-4o exhibits the highest vision performance and excels in non-English languages compared to previous OpenAI models.

Specifications(11 models)

GPT-4o model specifications comparison
ModelReleasedContextParametersVisionMultimodalFn CallingTool UseStructured OutputsCode Exec
GPT-4o-mini Search Preview2025-02128KNoNoNoNoYesNo
GPT-4o Search Preview2025-02128KNoNoNoNoYesNo
GPT-4o (11-20)2024-11128K1.76T (8x222B MoE)*YesNoNoNoNoYes
GPT-4o (2024-11-20)2024-11128KNoNoNoNoYesNo
GPT-4o Audio2024-10128KNoNoNoNoNoNo
GPT-4o-mini2024-07128KNoNoNoNoYesNo
ChatGPT-4o2024-05128KYesNoNoNoNoYes
GPT-4o2024-05128KYesYesYesYesYesYes

Available From(5 providers)

Pricing

GPT-4o model pricing by provider
ModelProviderInput / 1MOutput / 1MType
GPT-4o-miniOpenAI API$0.15$0.6Serverless
GPT-4o-miniOpenRouter$0.15$0.6Serverless
GPT-4o-mini Search PreviewOpenRouter$0.15$0.6Serverless
GPT-4oReplicate API$2.5$10Serverless
GPT-4o AudioOpenRouter$2.5$10Serverless
GPT-4o Search PreviewOpenRouter$2.5$10Serverless
GPT-4o (2024-11-20)OpenRouter$2.5$10Serverless
GPT-4oOpenRouter$2.5$10Serverless
GPT-4oOpenAI API$2.5$10Serverless

Comparisons

All comparisons →

Frequently Asked Questions

What is GPT-4o used for?
GPT-4o is used for vision and multimodal work, agent workflows and tool use, and structured outputs. The family description and listed model capabilities point to those workloads as the best fit.
How does GPT-4o compare to GPT Realtime 2?
GPT-4o by OpenAI is strongest where you need vision and multimodal work, while GPT Realtime 2 by OpenAI is the closest related family to check for translation. GPT-4o has 11 listed variants and reaches up to 128K context, while GPT Realtime 2 reaches up to 131K context, so compare the specs and pricing tables before choosing a production model.
Which GPT-4o model should I use?
For the lowest listed input price, start with GPT-4o Mini (07-18) through OpenAI API at $0.15/1M input tokens. For the most capable/latest local choice, evaluate GPT-4o with 128K context and tool use, function calling, structured outputs, and multimodal inputs.

Models(11)