LLM ReferenceLLM Reference

o4-mini Models by OpenAI

1 model2025Up to 200K ctxFrom $2/1M input

About

o4-mini is a family of 1 AI model by OpenAI, released in 2025.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

1 in view

Use when the workload needs 200K context and structured outputs.

2025-04200K contextstructured outputs

Release Timeline

1 release group
2025-04
1 current
o4 Mini Deep Research
200K contextstructured outputs
Current

Specifications(1 models)

o4-mini model specifications comparison
ModelReleasedContextStructured Outputs
o4 Mini Deep Research2025-04200KYes

Available From(1 provider)

Pricing

o4-mini model pricing by provider
ModelProviderInput / 1MOutput / 1MType
o4 Mini Deep ResearchOpenRouter$2$8Serverless

Frequently Asked Questions

What is o4-mini used for?
o4-mini is used for structured outputs. The family description and listed model capabilities point to those workloads as the best fit.
How does o4-mini compare to GPT Realtime 2?
o4-mini by OpenAI is strongest where you need structured outputs, while GPT Realtime 2 by OpenAI is the closest related family to check for translation. o4-mini has 1 listed variant and reaches up to 200K context, while GPT Realtime 2 reaches up to 131K context, so compare the specs and pricing tables before choosing a production model.
Which o4-mini model should I use?
For the lowest listed input price, start with o4 Mini Deep Research through OpenRouter at $2/1M input tokens. For the most capable/latest local choice, evaluate o4 Mini Deep Research with 200K context and structured outputs.

Models(1)