GPT-4o Mini TTS
GPT-4o Mini TTS is worth evaluating for vision when its provider route and context window match the workload.
Use it for
- Teams evaluating vision
- Workloads that can use a 2k context window
- Buyers comparing 1 tracked provider route
Do not use it for
- Strict JSON or tool-calling flows
- Family
- OpenAI Text-to-Speech
- Released
- 2025-03-20
- Context
- 2k
- Architecture
- Decoder Only
- Knowledge cutoff
- 2024-09
- Specialization
- audio
- Openness
- Proprietary
- License
- ProprietaryCommercial use with conditions
- Training
- finetuned
Cheapest of 1 route · OpenAI API
About
GPT-4o Mini TTS is OpenAI's instructable text-to-speech model built on GPT-4o mini, released March 20, 2025. Supports natural language instructions to control tone, style, pacing, and emotion (instructable TTS) — the model follows conversational prompts to adjust delivery rather than relying on static voice presets. OpenAI's recommended cost-efficient TTS for production. Input: text tokens + instructions at $0.60/1M tokens. Output: audio tokens at $12.00/1M tokens. Accepts up to 2,000 input tokens. API ID: gpt-4o-mini-tts.
GPT-4o Mini TTS is a proprietary model in the OpenAI Text-to-Speech family. The structured metadata tracks a 2k-token context window, multimodal input, and audio. This page tracks provider routes through OpenAI API. No headline benchmark score is tracked for GPT-4o Mini TTS yet.
Top use-case fit
Vision
Included by capability and metadata signals in the decision map.
Provider price ladder
Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.
| Provider | Input / 1M | Output / 1M | Route |
|---|---|---|---|
| OpenAI API | $0.600 | - | ServerlessPartial |
Capabilities
Benchmark peer barsfor Vision
No task-mapped benchmark peers are available for this model yet.
Migration checks
No linked migration route is available for this model yet.
API versions
gpt-4o-mini-ttsgpt-4o-mini-tts-2025-03-20Rankings & picks(2)
Cheapest of 1 route · OpenAI API