GPT-4 Models by OpenAI
5 models2023–2024Up to 128K ctxFrom $5/1M input
About
GPT-4 is a large multimodal model that accepts text or image inputs and outputs text. It can solve complex problems with greater accuracy than any of our previous models, thanks to its extensive general knowledge and advanced reasoning capabilities.
Current Variants
Use-when guidance is derived from seed capabilities, context, release, and replacement fields.
1 in view4 retired
GPT-4 Vision PreviewCurrent
Use when the workload needs 128K context, code execution, and multimodal inputs.
2023-11128K contextcode executionmultimodal inputs
| Model | Use when | Released | Signals | Status |
|---|---|---|---|---|
| GPT-4 Vision Preview | Use when the workload needs 128K context, code execution, and multimodal inputs. | 2023-11 | 128K contextcode executionmultimodal inputs | Current |
Release Timeline
3 release groups2024-04
1 retired
GPT-4 Turbo
Replaced128K contexttool usefunction calling
2023-11
1 current · 1 retired
GPT-4 Turbo Preview
Replaced128K contextstructured outputscode execution
GPT-4 Vision Preview
Current128K contextcode executionmultimodal inputs
Replaced By
Replaced
Keep for legacy integrations; evaluate GPT-4.1 before new work.
Replaced
Keep for legacy integrations; evaluate GPT-4.1 before new work.
Specifications(5 models)
| Model | Released | Context | Parameters | Vision | Multimodal | Fn Calling | Tool Use | Structured Outputs | Code Exec |
|---|---|---|---|---|---|---|---|---|---|
| GPT-4 Vision Preview | 2023-11 | 128K | 1.76T (8x222B MoE)* | Yes | Yes | No | No | No | Yes |
Available From(6 providers)
Pricing
| Model | Provider | Input / 1M | Output / 1M | Type |
|---|---|---|---|---|
| GPT-4 Vision Preview | Azure OpenAI | $10 | $40 | Serverless |
Frequently Asked Questions
- What is GPT-4 used for?
- GPT-4 is used for vision and multimodal work, agent workflows and tool use, and structured outputs. The family description and listed model capabilities point to those workloads as the best fit.
- How does GPT-4 compare to GPT Realtime 2?
- GPT-4 by OpenAI is strongest where you need vision and multimodal work, while GPT Realtime 2 by OpenAI is the closest related family to check for translation. GPT-4 has 5 listed variants and reaches up to 128K context, while GPT Realtime 2 reaches up to 131K context, so compare the specs and pricing tables before choosing a production model.
- Which GPT-4 model should I use?
- For the lowest listed input price, start with GPT-4 Turbo through Replicate API at $5/1M input tokens. For the most capable/latest local choice, evaluate GPT-4 Vision Preview with 128K context and multimodal inputs.






