GPT-5
GPT-5 is worth evaluating for coding, rag, and agents when its provider route and context window match the workload.
Use it for
- Teams evaluating coding, rag, and agents
- Workloads that can use a 400k context window
- Buyers comparing 4 tracked provider routes
Do not use it for
- Workloads where another current model has stronger sourced task evidence
- Family
- GPT-5
- Released
- 2025-08-07
- Context
- 400k
- Max output
- 128,000
- Architecture
- Decoder Only
- Knowledge cutoff
- 2024-09
- Specialization
- general
- Openness
- Proprietary
- License
- ProprietaryCommercial use with conditions
- Training
- Pretrained
Cheapest of 4 routes · OpenAI API · cache read $0.125
About
OpenAI's previous intelligent reasoning model with configurable reasoning effort. Released August 2025. Supports minimal, low, medium, and high reasoning levels. Succeeded by GPT-5.1 and later models.
GPT-5 is a proprietary model. The structured metadata tracks a 400k-token context window, multimodal input, reasoning, function calling, tool use, structured outputs, and code execution. This page tracks provider routes through Replicate API, OpenRouter, OpenAI API, and 1 more, with the cheapest tracked route listed at $1.25 input and $10 output per 1M tokens. Headline tracked benchmarks include SWE-bench Verified 74.9, MMMU Pro 78.4, and Aider Polyglot 88.0.
Top use-case fit: coding, agents, and build tasks
Coding
Q/$ D2 relevant benchmarks in the decision map.
RAG
Included by capability and metadata signals in the decision map.
Agents
Q/$ D1 relevant benchmark in the decision map.
Provider price ladder
Compare all 4Compare API pricing across 4 providers for input and output tokens, batch, and cached reads when available.
| Provider | Input / 1M | Output / 1M | Batch in / out | Cache | Route |
|---|---|---|---|---|---|
| OpenAI API | $1.25 | $10.00 | $0.625 / $5.00 | read $0.125 | Serverless |
| OpenRouter | $1.25 | $10.00 | - | - | Serverless |
| Replicate API | $1.25 | $10.00 | - | - | Serverless |
| Vercel AI Gateway | $1.25 | $10.00 | - | read $0.125 | Serverless |
Available via routers & gateways(15)
AIRouter
RouterCommercial LLM router that analyzes incoming requests and routes to the optimal model for cost/quality/latency via a drop-in OpenAI-compatible API, with a privacy-preserving embedding mode that avoids sending prompt content.
Helicone
GatewayObservability-first AI gateway with routing, caching, rate limiting, and request tracing; Apache 2.0 open-source core with a managed hosted tier for logging and analytics.
Kong AI Gateway
GatewayMulti-LLM AI gateway built on Kong Gateway 3.x, adding semantic routing, load balancing, guardrails, and MCP traffic analytics as plugins over Kong's existing API management platform.
LiteLLM
GatewayOpen-source Python SDK and proxy server that unifies 100+ LLM APIs behind a single OpenAI-compatible interface, with load balancing, cost tracking, and configurable failover.
Martian
RouterAI-powered LLM router that analyzes each prompt in real-time to select the optimal model, targeting 20–97% cost reduction while maintaining quality; San Francisco startup reportedly nearing $1.3B valuation.
Neutrino AI
RouterCommercial LLM router that dynamically routes each query to the best-suited model with load balancing and fallback handling, charging 3% of underlying AI spend.
Capabilities
Benchmark peer barsfor Coding
Benchmark scores(5)
| Benchmark | Score | Version | Source |
|---|---|---|---|
| SWE-bench Verified | 74.9 | — | https://openai.com/index/introducing-gpt-5/ |
| MMMU Pro | 78.4 | thinking mode (highest across thinking levels) | https://openai.com/index/introducing-gpt-5/ |
| Aider Polyglot | 88.0 | Aider Polyglot | https://aider.chat/docs/leaderboards/ |
| Google-Proof Q&A | 88.4 | — | https://openai.com/blog/gpt-5 |
| AIME 2025 | 94.6 | — | https://openai.com/blog/gpt-5 |
Migration checks
No linked migration route is available for this model yet.
Rankings & picks(1)
Comparison and alternatives
Browse all comparisons →Show all 5 popular comparisonssorted by 7-day search impressions
Cheapest of 4 routes · OpenAI API · cache read $0.125