Last refreshed 2026-05-22. Next refresh: weekly.
Why use PaddleOCR VL on Novita AI?
Novita AI offers PaddleOCR VL with pay-as-you-go pricing at $0.02/1M input tokens. Novita AI offers a GPU-based inference API for image, video, and language model generation with a broad catalog of open-source models.
Setup recipe
Docs fallbackUse the provider REST API or SDKCreate a provider API keymodel: paddleocr-vlpaddleocr-vlRequest example
Gotchas
No curated gotchas have been sourced for this exact provider/model route yet.
Pricing
| Type | Price (per 1M) |
|---|---|
| Input tokens | $0.02 |
| Output tokens | $0.02 |
Capabilities
About PaddleOCR VL
PaddleOCR VL is a 0.9B ultra-compact vision-language model from Baidu's PaddlePaddle team for multilingual document parsing. Combines a NaViT-style dynamic resolution visual encoder with ERNIE-4.5-0.3B. Supports 109 languages for recognizing text, tables, formulas, and charts. Achieved 92.56 on OmniDocBench V1.5, surpassing larger models including DeepSeek-OCR. Released October 16, 2025.
FAQ
What does PaddleOCR VL cost on Novita AI?
On Novita AI, PaddleOCR VL costs $0.02 per 1M input tokens and $0.02 per 1M output tokens.
What is the context window for PaddleOCR VL on Novita AI?
PaddleOCR VL supports a 16,384 token context window on Novita AI.
What API model ID do I use for PaddleOCR VL on Novita AI?
Use the model ID paddleocr-vl when calling Novita AI's API.
Who created PaddleOCR VL?
PaddleOCR VL was created by Baidu AI as part of the PaddleOCR VL model family.
Is PaddleOCR VL open source?
PaddleOCR VL is open source according to the seed data.