o1 is used for reasoning, structured outputs, and code execution. The family description and listed model capabilities point to those workloads as the best fit.

How does o1 compare to GPT Realtime 2?

o1 by OpenAI is strongest where you need reasoning, while GPT Realtime 2 by OpenAI is the closest related family to check for realtime voice. o1 has 4 listed variants and reaches up to 200k context, while GPT Realtime 2 reaches up to 131k context, so compare the specs and pricing tables before choosing a production model.

Which o1 model should I use?

For the lowest listed input price, start with o1-mini (09-12) through Replicate API at $1.1/1M input tokens. For the most capable/latest local choice, evaluate o1-preview (09-12) with 128k context and reasoning.

o1 Models by OpenAI

OpenAIProprietaryHighlight

4 models2024Up to 200k ctxFrom $1.1/1M input

Details

ResearcherOpenAI

LicenseProprietary

Commercial useCommercial use: conditional

Models4

Released2024

Max context200k

Capabilities

Reasoning3 of 4 models

Structured Outputs1 of 4 models

Code Execution3 of 4 models

Links

Website

About

OpenAI's o1 family of large language models (LLMs) represents a significant advancement in AI's reasoning capabilities 4712. Unlike previous models that primarily focused on predicting the next word in a sequence, o1 models are trained to "think" before responding, employing a chain-of-thought process to solve complex problems 4712. This involves breaking down complex tasks into smaller, manageable steps, exploring different approaches, and correcting errors along the way 4712. The o1 models demonstrate improved performance on various benchmarks, including those involving mathematics, coding, and science, rivaling or exceeding the capabilities of human experts in certain areas 4712. However, these models are still under development and may lack some features found in earlier models, such as web browsing and image processing 15.

Current Variants

Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.

3 in view1 retired

o1-proCurrent

Use when the workload needs 200k context and structured outputs.

2024-12200k contextstructured outputs

o1-preview (09-12)Current

Use when the workload needs 128k context, reasoning, and code execution.

2024-09128k contextreasoningcode execution

o1-mini (09-12)Current

Use when the workload needs 128k context, reasoning, and code execution.

2024-09128k contextreasoningcode execution

Current o1 variants with use-when guidance and lifecycle status
Model	Use when	Released	Signals	Status
o1-pro	Use when the workload needs 200k context and structured outputs.	2024-12	200k contextstructured outputs	Current
o1-preview (09-12)	Use when the workload needs 128k context, reasoning, and code execution.	2024-09	128k contextreasoningcode execution	Current
o1-mini (09-12)	Use when the workload needs 128k context, reasoning, and code execution.	2024-09	128k contextreasoningcode execution	Current

Release Timeline

2 release groups

2024-12

1 current · 1 retired

o1 (12-17)

128k contextreasoningcode execution

Replaced

o1-pro

200k contextstructured outputs

Current

2024-09

2 current

o1-mini (09-12)

128k contextreasoningcode execution

Current

o1-preview (09-12)

128k contextreasoningcode execution

Current

Replaced By

o1 (12-17)o3

Replaced

Keep for legacy integrations; evaluate o3 before new work.

Specifications(4 models)

o1 model specifications comparison
Model	Released	Context	Reasoning	Structured Outputs	Code Exec
o1-pro	2024-12	200k	No	Yes	No
o1-preview (09-12)	2024-09	128k	Yes	No	Yes
o1-mini (09-12)	2024-09	128k	Yes	No	Yes

Available From(3 providers)

OpenAI API

OpenRouter

Replicate API

Pricing

o1 model pricing by provider
Model	Provider	Input / 1M	Output / 1M	Type
o1-mini (09-12)	Replicate API	$1.1	$4.4	Serverless
o1-pro	OpenRouter	$150	$600	Serverless

Popular comparisons in this family

Comparisons

All comparisons →

Frequently Asked Questions

What is o1 used for?: o1 is used for reasoning, structured outputs, and code execution. The family description and listed model capabilities point to those workloads as the best fit.
How does o1 compare to GPT Realtime 2?: o1 by OpenAI is strongest where you need reasoning, while GPT Realtime 2 by OpenAI is the closest related family to check for realtime voice. o1 has 4 listed variants and reaches up to 200k context, while GPT Realtime 2 reaches up to 131k context, so compare the specs and pricing tables before choosing a production model.
Which o1 model should I use?: For the lowest listed input price, start with o1-mini (09-12) through Replicate API at $1.1/1M input tokens. For the most capable/latest local choice, evaluate o1-preview (09-12) with 128k context and reasoning.

Models(4)

o1-pro

2024-12200k1 provider

o1-preview (09-12)

2024-09128k

Reasoning

o1-mini (09-12)

2024-09128k1 provider

Reasoning

o1 Models by OpenAI

Details

Capabilities

Links

About

Current Variants

Release Timeline

Replaced By

Specifications(4 models)

Available From(3 providers)

Pricing

Popular comparisons in this family

Comparisons

Frequently Asked Questions

Related Model Families

Models(4)