GPT-5.5
- MMLU
- 92.4%
- Output (from)
- $30.00 / 1M
Last refreshed 2026-06-03. Next refresh: weekly.
Production-grade LLMs available on enterprise provider surfaces with function calling and structured outputs, ranked by MMLU and context window.
Verdict
DeepSeek V3 is the runner-up, 4 points back on MMLU.
Enterprise picks use the §6.4 enterpriseReady truth table: enterprise provider coverage (AWS Bedrock, Azure OpenAI/Foundry, GCP Vertex, Anthropic API, OpenAI API, Google AI Studio), `commercialAvailability = ga`, plus function calling and structured outputs.
| # | Model | Input $/1M | Output $/1M | |
|---|---|---|---|---|
| 1 | GPT-5.5 ReasoningVisionTools MMLU: 92.4% | $5.00 | $30.00 | |
| 2 | DeepSeek V3 Tools MMLU: 88.5% | $0.10 | $0.28 | |
| 3 | Mistral Large 2 VisionTools MMLU: 84% | $0.48 | $1.50 | |
| 4 | GPT-5.5 Pro ReasoningVisionTools MMLU: — | $30.00 | $180.00 | |
| 5 | GPT-5.4 ReasoningTools MMLU: — | $2.50 | $15.00 | |
| 6 | GPT-5.4 Pro ReasoningVisionTools MMLU: — | $30.00 | $180.00 | |
| 7 | Gemini 3.1 Flash-Lite VisionTools MMLU: — | $0.25 | $1.50 | |
| 8 | GPT-4.1 VisionTools MMLU: — | $2.00 | $8.00 | |
| 9 | GPT-4.1 Mini VisionTools MMLU: — | $0.40 | $1.60 | |
| 10 | Grok 4.3 ReasoningVisionTools MMLU: — | $1.25 | $2.50 | |
| 11 | DeepSeek V4 Flash ReasoningTools MMLU: — | $0.10 | $0.20 | |
| 12 | Claude Opus 4.7 ReasoningVisionTools MMLU: — | $5.00 | $25.00 | |
| 13 | Claude Sonnet 4.6 ReasoningVisionTools MMLU: — | $3.00 | $15.00 | |
| 14 | Claude Opus 4.6 ReasoningVisionTools MMLU: — | $5.00 | $25.00 | |
| 15 | Gemini 2.5 Flash Lite VisionTools MMLU: — | $0.10 | $0.40 | |
| 16 | Gemini 2.5 Flash VisionTools MMLU: — | $0.30 | $2.50 | |
| 17 | Gemini 2.5 Pro ReasoningVisionTools MMLU: — | $1.25 | $10.00 | |
| 18 | Gemini 2.0 Flash Live API VisionTools MMLU: — | $0.50 | — | |
| 19 | GPT-5.5 Instant VisionTools MMLU: — | $5.00 | $30.00 | |
| 20 | GPT-5.4 Mini ReasoningTools MMLU: — | $0.75 | $4.50 |
Next seats in this ranking. Lines below are from each model's stored description in LLMReference seed data—spot-check the model page before relying on a capability claim.
GPT-5.5 Pro is OpenAI's premium variant of GPT-5.5, released April 23, 2026. Targets large quality gains for business, legal, education, and data science use cases. Scores 39.6% on FrontierMath Tier 4 (postdoctoral-level math problems), compared to 22.9% for Claude Opus 4.7. Priced at 6× the standard GPT-5.5 API rate. Available to ChatGPT subscribers and via API.
—
MMLU
GPT-5.4 is OpenAI's flagship frontier reasoning model, released March 5, 2026. It incorporates advances from GPT-5.3-Codex for coding and agentic workflows, and adds 'Thinking' mode with editable reasoning plans. Key capabilities include computer use (navigating interfaces via Playwright), image understanding and generation integration, full-stack web app generation, tool calling, and deep research. Knowledge cutoff is August 31, 2025. Model ID: gpt-5.4.
—
MMLU
Premium extended-reasoning GPT-5.4 variant producing smarter and more precise responses. Replacement for o3-deep-research and o4-mini-deep-research. No prompt caching discount.
—
MMLU
Side-by-side comparison of the top picks by price, benchmark, and API access.