llmreference

GPT Audio Mini

gpt-audio-mini

Researched 1d ago

Last refreshed 2026-05-19. Next refresh: weekly.

ProprietaryMultimodalLong contextVision

GPT Audio Mini is worth evaluating for long context and vision when its provider route and context window match the workload.

Decision context: Long context task fit, 2 tracked provider routes, and research from 2026-05-19.

Use it for

  • Teams evaluating long context and vision
  • Workloads that can use a 128K context window
  • Buyers comparing 2 tracked provider routes

Do not use it for

  • Strict JSON or tool-calling flows

Cheapest output

$2.40

OpenAI API per 1M tokens

Provider routes

2

Tracked API hosts

Quality / dollar

Unknown

No task benchmark coverage yet

Freshness

2026-05-19

Researched 1d ago

fresh

Top use-case fit

Long context

Included by capability and metadata signals in the decision map.

Vision

Included by capability and metadata signals in the decision map.

Provider price ladder

ProviderInput / 1MOutput / 1MRoute
OpenAI API$0.600$2.40
Serverless
OpenRouter$0.600$2.40
Serverless

Benchmark peer barsfor Long context

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

About

GPT Audio Mini is OpenAI's GPT Audio model with multimodal text and image input. It offers a 125K-token context window.

GPT Audio Mini has a 128K-token context window.

GPT Audio Mini input tokens at $0.6/1M, output at $2.4/1M.

Capabilities

MultimodalAudio

Rankings

Specifications

FamilyGPT Audio
Released2024-10-01
Context128K
Max output16,384
ArchitectureDecoder Only
Knowledge cutoff2023-10
Specializationgeneral
LicenseProprietary

Created by

Cutting-edge research and development.

San Francisco, California, United States
Founded 2015
Website