LLM Reference

GPT-4 Turbo

gpt-4-turbo

DeprecatedResearched 16d ago

Last refreshed 2026-05-22. Next refresh: weekly.

ProprietaryMultimodalCodingRAGAgentsLong contextVisionClassificationJSON / Tool usehighlight

GPT-4 Turbo is a legacy integration reference; evaluate GPT-4.1 before starting new work.

Decision context: Classification task fit, 6 tracked provider routes, and research from 2026-05-10.

Use it for

  • Teams maintaining an existing integration
  • Workloads that can use a 128K context window
  • Buyers comparing 4 tracked provider routes

Do not use it for

  • New production launches

Cheapest output

$15.00

Replicate API per 1M tokens

Provider routes

6

Tracked API hosts

Quality / dollar

Unknown

No output-token price in the ladder

Freshness

2026-05-10

Researched 16d ago

fresh

This API model is marked deprecated. Use GPT-4.1 as the replacement candidate before sending new production traffic.

Top use-case fit

Coding

Included by capability and metadata signals in the decision map.

RAG

Included by capability and metadata signals in the decision map.

Agents

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare all 6
ProviderInput / 1MOutput / 1MBatch in / outRoute
Replicate API$5.00$15.00-
Serverless
OpenAI API$10.00$30.00$5.00 / $15.00
Serverless
OpenRouter$10.00$30.00-
Serverless
Vercel AI Gateway$10.00$30.00-
Serverless

Benchmark peer barsfor Classification

Migration checks

No linked migration route is available for this model yet.

About

OpenAI's high-performance variant of GPT-4 with 128K context window and improved reasoning. Widely used for production applications.

GPT-4 Turbo has a 128K-token context window.

GPT-4 Turbo input tokens at $5/1M, output at $15/1M.

Capabilities

VisionMultimodalFunction CallingTool UseStructured OutputsCode Execution

Benchmark Scores(3)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.
BenchmarkScoreVersionSource
Massive Multitask Language Understanding86.55-shotOpenAI
Chatbot Arena1242.0https://lmarena.ai
MMLU PRO69.4https://huggingface.co/spaces/TIGER-Lab/MMLU-Pro

API Versions

gpt-4-turbo-2024-04-09gpt-4-turbo

Rankings

Show all 1 popular comparisonssorted by 7-day search impressions

Specifications

FamilyGPT-4
Released2024-04-09
Parameters1.76T (8x222B MoE)*
Context128K
ArchitectureMixture of Experts
Knowledge cutoff2023-12
Specializationgeneral
LicenseProprietary
Trainingfinetuned

Created by

Cutting-edge research and development.

San Francisco, California, United States
Founded 2015
Website