Last refreshed 2026-05-19. Next refresh: weekly.
Why use o4-mini on Replicate API?
Replicate API offers o4-mini with pay-as-you-go pricing at $1.00/1M input tokens. Replicate is a cloud-based platform that enables users to run machine learning models easily and efficiently.
Compare o4-mini across 4 providers to find the best fit for your use caseSetup recipe
Python + curlpip install replicateexport REPLICATE_API_TOKEN=...import replicate
output = replicate.run(
"o4-mini",
input={"prompt": "Hello"}o4-miniRequest example
import replicate
# reads REPLICATE_API_TOKEN from env
# o4-mini format: "owner/model-name" (latest version) or "owner/model-name:version-hash"
output = replicate.run(
"o4-mini",
input={"prompt": "Hello"}
)
# Output is a list or generator depending on the model
print("".join(output))Gotchas
- Replicate uses "owner/model-name" format (e.g. "meta/meta-llama-3-8b-instruct") for the latest version, or "owner/model-name:version-sha" to pin to a specific version. The REST endpoint splits owner and model-name into the path: /v1/models/{owner}/{model-name}/predictions.
- The examples expect REPLICATE_API_TOKEN; rename it only if your application config maps the new variable.
Compare o4-mini Across Providers
| Provider | Input (per 1M) | Output (per 1M) |
|---|---|---|
| OpenAI API | $1.10 | $4.40 |
| OpenRouter | $1.10 | $4.40 |
| Replicate API | $1.00 | $4.00 |
| Vercel AI Gateway | $1.10 | $4.40 |
Pricing
| Type | Price (per 1M) |
|---|---|
| Input tokens | $1.00 |
| Output tokens | $4.00 |
Capabilities
About o4-mini
Fast and cost-efficient reasoning model with vision support for math, coding, and visual understanding. Retired from ChatGPT February 13, 2026 but still available via API. Released April 16, 2025.
FAQ
What does o4-mini cost on Replicate API?
On Replicate API, o4-mini costs $1 per 1M input tokens and $4 per 1M output tokens.
What is the context window for o4-mini on Replicate API?
o4-mini supports a 200k token context window on Replicate API.
How does Replicate API compare to other o4-mini providers?
o4-mini is available from 4 providers. The cheapest input pricing is $1/1M tokens from Replicate API.
Who created o4-mini?
o4-mini was created by OpenAI as part of the o3 model family.
Is o4-mini open source?
o4-mini is not open source; the seed data lists it as proprietary.