LLM Reference

gpt-realtime-mini

Released
2025-10-06
Last refreshed
2026-05-19
Status
Researched 38d ago
ProprietaryCommercial use: conditionalMultimodalVision

gpt-realtime-mini is worth evaluating for vision when its provider route and context window match the workload.

Use it for

  • Teams evaluating vision
  • Workloads that can use a 32k context window
  • Buyers comparing 1 tracked provider route

Do not use it for

  • Strict JSON or tool-calling flows
Specifications
Released
2025-10-06
Context
32k
Max output
4,096
Architecture
Decoder Only
Knowledge cutoff
2023-10
Specialization
realtime-voice
Openness
Proprietary
License
ProprietaryCommercial use: conditional
Training
Pretrained
Created by

Cutting-edge research and development.

San Francisco, California, United States
Founded 2015
Website
Pricing
Output / 1M
$2.40
Input / 1M
$0.600

Cheapest of 1 route · OpenAI API · cache read $0.060

About

gpt-realtime-mini is OpenAI's GPT Realtime model with multimodal text and image input. It offers a 32K-token context window.

gpt-realtime-mini is a proprietary model in the GPT Realtime family. The structured metadata tracks a 32k-token context window, multimodal input, and audio. This page tracks provider routes through OpenAI API, with the cheapest tracked route listed at $0.6 input and $2.4 output per 1M tokens. No headline benchmark score is tracked for gpt-realtime-mini yet.

Top use-case fit

Vision

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.

ProviderInput / 1MOutput / 1MCacheRoute
OpenAI API$0.600$2.40read $0.060
Serverless

Available via routers & gateways(15)

Capabilities

MultimodalAudio

Benchmark peer barsfor Vision

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.