GPT-4 Vision Preview
gpt-4-vision-preview
Last refreshed 2026-05-19. Next refresh: weekly.
GPT-4 Vision Preview has model metadata, but missing tracked provider pricing keeps it from being a default production pick.
Decision context: Vision task fit, 1 tracked provider route, and research from 2026-05-19.
Use it for
- Teams evaluating coding, agents, and long context
- Workloads that can use a 128K context window
Do not use it for
- Cost-sensitive launches that need sourced token pricing
- Strict JSON or tool-calling flows
Cheapest output
-
No tracked output price
Provider routes
1
Tracked API hosts
Quality / dollar
Unknown
No output-token price in the ladder
Freshness
2026-05-19
Researched 7d ago
Top use-case fit
Coding
Included by capability and metadata signals in the decision map.
Agents
Included by capability and metadata signals in the decision map.
Long context
Included by capability and metadata signals in the decision map.
Provider price ladder
No tracked provider token pricing is available for this model yet.
Benchmark peer barsfor Vision
Migration checks
No linked migration route is available for this model yet.
About
GPT-4 Vision Preview is OpenAI's GPT-4 model with multimodal text and image input. It is deprecated (originally released 2023-11-06); use it only for reproducing earlier results or evaluating drift over time.
GPT-4 Vision Preview has a 128K-token context window.
GPT-4 Vision Preview input tokens at $10/1M, output at $40/1M.
Capabilities
Benchmark Scores(2)
| Benchmark | Score | Version | Source |
|---|---|---|---|
| Massive Multi-discipline Multimodal Understanding | 56.0 | — | https://mmmu-benchmark.github.io/ |
| GAOKAO-MM | 48.1 | zero-shot | https://github.com/OpenMOSS/GAOKAO-MM |
API Versions
gpt-4-1106-vision-previewgpt-4-vision-previewCompare
All comparisons →Specifications
Created by
Cutting-edge research and development.