LLM Reference

o3 Models by OpenAI

5 models2025Up to 200k ctxFrom $1/1M input

About

o3 is a family of 5 AI models by OpenAI, released in 2025.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

3 in view2 retired

Use when the workload needs 200k context, reasoning, and tool use.

2025-10200k contextreasoningtool use
o3-proCurrent

Use when the workload needs 200k context, reasoning, and tool use.

2025-06200k contextreasoningtool use
o3Current

Use when the workload needs 200k context, reasoning, and tool use.

2025-04200k contextreasoningtool use

Release Timeline

4 release groups
2025-10
1 current
o3 Deep Research
200k contextreasoningtool use
Current
2025-06
1 current
o3-pro
200k contextreasoningtool use
Current
2025-04
1 current · 1 retired
o3
200k contextreasoningtool use
Current
o4-mini
200k contextreasoningtool use
Replaced
2025-01
1 retired
o3 Mini
200k contextreasoningtool use
Replaced

Replaced By

Keep for legacy integrations; evaluate GPT-5 Mini before new work.

Replaced

Keep for legacy integrations; evaluate o3 before new work.

Specifications(5 models)

o3 model specifications comparison
ModelReleasedContextVisionMultimodalReasoningFn CallingTool UseStructured OutputsCode Exec
o3 Deep Research2025-10200kYesYesYesYesYesYesNo
o3-pro2025-06200kYesYesYesYesYesYesYes
o32025-04200kYesYesYesYesYesYesYes

Available From(5 providers)

Pricing

o3 model pricing by provider
ModelProviderInput / 1MOutput / 1MType
o3OpenAI API$2$8Serverless
o3OpenRouter$2$8Serverless
o3Vercel AI Gateway$2$8Serverless
o3 Deep ResearchVercel AI Gateway$10$40Serverless
o3-proOpenRouter$20$80Serverless
o3-proOpenAI API$20$80Serverless
o3-proVercel AI Gateway$20$80Serverless

Comparisons

All comparisons →

Frequently Asked Questions

What is o3 used for?
o3 is used for vision and multimodal work, reasoning, and agent workflows and tool use. The family description and listed model capabilities point to those workloads as the best fit.
How does o3 compare to GPT Realtime 2?
o3 by OpenAI is strongest where you need vision and multimodal work, while GPT Realtime 2 by OpenAI is the closest related family to check for translation. o3 has 5 listed variants and reaches up to 200k context, while GPT Realtime 2 reaches up to 131k context, so compare the specs and pricing tables before choosing a production model.
Which o3 model should I use?
For the lowest listed input price, start with o4-mini through OpenAI API at $1.1/1M input tokens. For the most capable/latest local choice, evaluate o3-pro with 200k context and reasoning, tool use, function calling, structured outputs, and multimodal inputs.

Models(5)