DeepSeek OCR 2
deepseek-ocr-2
Last refreshed 2026-05-22. Next refresh: weekly.
DeepSeek OCR 2 is worth evaluating for vision when its provider route and context window match the workload.
Decision context: Vision task fit, 1 tracked provider route, and research from 2026-05-22.
Use it for
- Teams evaluating vision
- Workloads that can use a 8K context window
- Buyers comparing 1 tracked provider route
Do not use it for
- Strict JSON or tool-calling flows
Cheapest output
$0.030
Novita AI per 1M tokens
Provider routes
1
Tracked API hosts
Quality / dollar
Unknown
No task benchmark coverage yet
Freshness
2026-05-22
Researched today
Top use-case fit
Vision
Included by capability and metadata signals in the decision map.
Provider price ladder
| Provider | Input / 1M | Output / 1M | Route |
|---|---|---|---|
| Novita AI | $0.030 | $0.030 | Serverless |
Benchmark peer barsfor Vision
No task-mapped benchmark peers are available for this model yet.
Migration checks
No linked migration route is available for this model yet.
About
DeepSeek OCR 2 is the second-generation OCR vision-language model from DeepSeek, introducing Visual Causal Flow as a key innovation for improved document understanding and recognition. Released with arXiv paper 2601.20552 (January 2026). 1.5M+ downloads per month on HuggingFace.
DeepSeek OCR 2 has a 8K-token context window.
DeepSeek OCR 2 input tokens at $0.03/1M, output at $0.03/1M.
Capabilities
Specifications
Created by
Advancing artificial general intelligence (AGI).