GPT-4o Transcribe

Name: GPT-4o Transcribe
Author: OpenAI

Released

2025-03-20

Last refreshed

2026-06-07

Status

Researched 45d ago

ProprietaryCommercial use: conditionalMultimodalVisionAudio

GPT-4o Transcribe is worth evaluating for vision when its provider route and context window match the workload.

Use it for

Teams evaluating vision
Workloads that can use a 16k context window
Buyers comparing 1 tracked provider route

Do not use it for

Strict JSON or tool-calling flows

Specifications

Family: OpenAI Transcribe
Released: 2025-03-20
Context: 16k
Max output: 2,000
Architecture: Decoder Only
Knowledge cutoff: 2024-09
Specialization: speech-recognition
Openness: Proprietary
License: ProprietaryCommercial use: conditional
Weights: Not released
Code: Unknown
Training: Fine-tuned

Created by

OpenAI

Cutting-edge research and development.

San Francisco, California, United States

Founded 2015

Website

Pricing

Output / 1M

$10.00

Input / 1M

Cheapest of 1 route · OpenAI API

Providers(1)

OpenAI API

View 1 provider route

Links

Website

About

GPT-4o Transcribe is OpenAI's flagship speech-to-text model based on GPT-4o, released March 20, 2025. Delivers substantially better word error rates than Whisper — especially for accented speech, background noise, and variable speaking rates. Supports batch, streaming (Realtime API), and Assistants endpoints. Input: $2.50/1M audio tokens. Output: $10.00/1M text tokens. Practical: ~$0.006/min. API ID: gpt-4o-transcribe.

GPT-4o Transcribe is a proprietary model in the OpenAI Transcribe family. The structured metadata tracks a 16k-token context window, multimodal input, and audio. This page tracks provider routes through OpenAI API. No headline benchmark score is tracked for GPT-4o Transcribe yet.

Top use-case fit

Vision

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Route
OpenAI API	-	$10.00	ServerlessPartial

Available via routers & gateways(15)

LiteLLM

Gateway

Open-source Python SDK and proxy server that unifies 100+ LLM APIs behind a single OpenAI-compatible interface, with load balancing, cost tracking, and configurable failover.

Free OSSOpenAI API

OpenRouter

Hybrid

Unified hybrid gateway to 400+ models from 60+ providers via a single OpenAI-compatible API, with optional auto-routing that selects the best model per prompt.

PassthroughOpenAI API

Portkey

Gateway

Production AI gateway routing to 1,600+ LLMs with failover, load balancing, semantic caching, and guardrails; Apache 2.0 core is fully self-hostable with the complete feature set.

SubscriptionOpenAI API

AIRouter

Router

Commercial LLM router that analyzes incoming requests and routes to the optimal model for cost/quality/latency via a drop-in OpenAI-compatible API, with a privacy-preserving embedding mode that avoids sending prompt content.

Passthrough + feeOpenAI API

Helicone

Gateway

Observability-first AI gateway with routing, caching, rate limiting, and request tracing; Apache 2.0 open-source core with a managed hosted tier for logging and analytics.

SubscriptionOpenAI API

Kong AI Gateway

Gateway

Multi-LLM AI gateway built on Kong Gateway 3.x, adding semantic routing, load balancing, guardrails, and MCP traffic analytics as plugins over Kong's existing API management platform.

SubscriptionOpenAI API