LLM ReferenceLLM Reference

GPT Realtime Translate

gpt-realtime-translate

ProprietaryMultimodal

About

GPT Realtime Translate is OpenAI's live speech-to-speech translation model, released May 7, 2026. It translates spoken input from 70+ languages into 13 output languages in real time without requiring speakers to pause or complete full sentences. The model is exposed through the /v1/realtime/translations endpoint and is priced per minute at $0.034 rather than per token.

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode ExecutionPrompt CachingBatch APIAudioFine-tuning

Providers(1)

ProviderInput (per 1M)Output (per 1M)Type
OpenAI APIServerless

API Versions

gpt-realtime-translate

Rankings

Specifications

Released2026-05-07
ArchitectureDecoder Only
Specializationtranslation
LicenseProprietary
Trainingpretrained

Created by

Cutting-edge research and development.

San Francisco, California, United States
Founded 2015
Website

Providers(1)