gpt-audio-1.5
gpt-audio-1.5
ProprietaryMultimodal
About
Best voice model for audio in / audio out via the Chat Completions API (non-realtime). Replaces deprecated gpt-4o-audio-preview (shut down May 7, 2026).
gpt-audio-1.5 has a 128K-token context window.
gpt-audio-1.5 input tokens at $2.5/1M, output at $10/1M.
Capabilities
VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode ExecutionPrompt CachingBatch APIAudioFine-tuning
Providers(1)
| Provider | Input (per 1M) | Output (per 1M) | Type | |
|---|---|---|---|---|
| OpenAI API | $2.50 | $10.00 | Serverless |