LLM Reference

GPT Realtime Translate

Released
2026-05-07
Last refreshed
2026-05-16
Status
Researched 41d ago
ProprietaryCommercial use: conditionalMultimodalVision

GPT Realtime Translate is worth evaluating for vision when its provider route and context window match the workload.

Use it for

  • Teams evaluating vision
  • Buyers comparing 1 tracked provider route

Do not use it for

  • Strict JSON or tool-calling flows
Specifications
Released
2026-05-07
Architecture
Decoder Only
Knowledge cutoff
2024-09
Specialization
translation
Openness
Proprietary
License
ProprietaryCommercial use: conditional
Training
Pretrained
Created by

Cutting-edge research and development.

San Francisco, California, United States
Founded 2015
Website
Pricing
Output / 1M
-
Input / 1M
-

Cheapest of 1 route · OpenAI API

About

GPT Realtime Translate is OpenAI's live speech-to-speech translation model, released May 7, 2026. It translates spoken input from 70+ languages into 13 output languages in real time without requiring speakers to pause or complete full sentences. The model is exposed through the /v1/realtime/translations endpoint and is priced per minute at $0.034 rather than per token.

GPT Realtime Translate is a proprietary model in the GPT Realtime 2 family. The structured metadata tracks multimodal input and audio. This page tracks provider routes through OpenAI API. No headline benchmark score is tracked for GPT Realtime Translate yet.

Top use-case fit

Vision

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.

ProviderInput / 1MOutput / 1MRoute
OpenAI API--
ServerlessPartial

Available via routers & gateways(15)

Capabilities

MultimodalAudio

Benchmark peer barsfor Vision

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

API versions

gpt-realtime-translate