LLM Reference
OpenAI API

GPT Realtime 2 on OpenAI API

GPT Realtime 2 · OpenAI

Serverless

Last refreshed 2026-05-16. Next refresh: weekly.

Why use GPT Realtime 2 on OpenAI API?

OpenAI API offers GPT Realtime 2 with pay-as-you-go pricing at $32.00/1M input tokens. OpenAI is a leading AI research and deployment company dedicated to developing safe and beneficial artificial general intelligence (AGI) for humanity.

Input / 1M
$32.00
Output / 1M
$64.00
Cache
read $0.40
Batch
Not sourced

Setup recipe

Python + curl
Install
pip install openai
Auth
export OPENAI_API_KEY=...
Call
import os
from openai import OpenAI
client = OpenAI()  # reads OPENAI_API_KEY from env
response = client.chat.completions.create(
Model ID
gpt-realtime-2

Request example

import os
from openai import OpenAI

client = OpenAI()  # reads OPENAI_API_KEY from env
response = client.chat.completions.create(
    model="gpt-realtime-2",
    messages=[{"role": "user", "content": "Hello"}]
)
print(response.choices[0].message.content)

Gotchas

  • Use the model slug directly, e.g. "gpt-4o" or "gpt-4.1-mini".
  • The examples expect OPENAI_API_KEY; rename it only if your application config maps the new variable.

Pricing

TypePrice (per 1M)
Input tokens$32.00
Output tokens$64.00

Capabilities

MultimodalReasoningFunction CallingTool UsePrompt CachingAudio

About GPT Realtime 2

GPT Realtime 2 is OpenAI's second-generation real-time voice model, released May 7, 2026. It is a GPT-5-class speech-to-speech model for voice agents with five reasoning intensity levels, parallel tool calls, spoken preambles, and recovery behavior on failed tasks. The model supports audio and text interaction through the Realtime API with a 128K token context window. Audio token pricing is $32 per 1M input tokens, $0.40 per 1M cached input tokens, and $64 per 1M output tokens.

FAQ

What does GPT Realtime 2 cost on OpenAI API?

On OpenAI API, GPT Realtime 2 costs $32 per 1M input tokens and $64 per 1M output tokens.

What is the context window for GPT Realtime 2 on OpenAI API?

GPT Realtime 2 supports a 131k token context window on OpenAI API.

What API model ID do I use for GPT Realtime 2 on OpenAI API?

Use the model ID gpt-realtime-2 when calling OpenAI API's API.

Who created GPT Realtime 2?

GPT Realtime 2 was created by OpenAI as part of the GPT Realtime 2 model family.

Is GPT Realtime 2 open source?

GPT Realtime 2 is not open source; the seed data lists it as proprietary.

Get Started

Model Specs

Released2026-05-07
Context131k
ArchitectureDecoder Only
Knowledge cutoff2024-09

Related Models on OpenAI API