GPT Realtime Translate
gpt-realtime-translate
ProprietaryMultimodal
About
GPT Realtime Translate is OpenAI's live speech-to-speech translation model, released May 7, 2026. It translates spoken input from 70+ languages into 13 output languages in real time without requiring speakers to pause or complete full sentences. The model is exposed through the /v1/realtime/translations endpoint and is priced per minute at $0.034 rather than per token.
Capabilities
VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode ExecutionPrompt CachingBatch APIAudioFine-tuning
Providers(1)
| Provider | Input (per 1M) | Output (per 1M) | Type | |
|---|---|---|---|---|
| OpenAI API | — | — | Serverless |
API Versions
gpt-realtime-translateSpecifications
FamilyGPT Realtime 2
Released2026-05-07
ArchitectureDecoder Only
Specializationtranslation
LicenseProprietary
Trainingpretrained
Created by
Cutting-edge research and development.
San Francisco, California, United States
Founded 2015
Website