LLM Reference

GPT-4 Models by OpenAI

OpenAIProprietaryHighlight
5 models2023–2024Up to 128K ctxFrom $5/1M input

About

GPT-4 is a large multimodal model that accepts text or image inputs and outputs text. It can solve complex problems with greater accuracy than any of our previous models, thanks to its extensive general knowledge and advanced reasoning capabilities.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

1 in view4 retired

Use when the workload needs 128K context, code execution, and multimodal inputs.

2023-11128K contextcode executionmultimodal inputs

Release Timeline

3 release groups
2024-04
1 retired
GPT-4 Turbo
128K contexttool usefunction calling
Replaced
2023-11
1 current · 1 retired
GPT-4 Turbo Preview
128K contextstructured outputscode execution
Replaced
GPT-4 Vision Preview
128K contextcode executionmultimodal inputs
Current
2023-03
2 retired
GPT-4
8K contextfunction callingstructured outputs
Replaced
GPT-4 32k
32K contextcode executionmultimodal inputs
Archived

Replaced By

Keep for legacy integrations; evaluate GPT-4.1 before new work.

Keep for legacy integrations; evaluate GPT-4.1 before new work.

Replaced

Keep for legacy integrations; evaluate GPT-4.1 before new work.

Specifications(5 models)

GPT-4 model specifications comparison
ModelReleasedContextParametersVisionMultimodalFn CallingTool UseStructured OutputsCode Exec
GPT-4 Vision Preview2023-11128K1.76T (8x222B MoE)*YesYesNoNoNoYes

Available From(6 providers)

Pricing

GPT-4 model pricing by provider
ModelProviderInput / 1MOutput / 1MType
GPT-4 Vision PreviewAzure OpenAI$10$40Serverless

Frequently Asked Questions

What is GPT-4 used for?
GPT-4 is used for vision and multimodal work, agent workflows and tool use, and structured outputs. The family description and listed model capabilities point to those workloads as the best fit.
How does GPT-4 compare to GPT Realtime 2?
GPT-4 by OpenAI is strongest where you need vision and multimodal work, while GPT Realtime 2 by OpenAI is the closest related family to check for translation. GPT-4 has 5 listed variants and reaches up to 128K context, while GPT Realtime 2 reaches up to 131K context, so compare the specs and pricing tables before choosing a production model.
Which GPT-4 model should I use?
For the lowest listed input price, start with GPT-4 Turbo through Replicate API at $5/1M input tokens. For the most capable/latest local choice, evaluate GPT-4 Vision Preview with 128K context and multimodal inputs.

Models(5)