Quick Start
- 1
- 2Use the Novita AI SDK or REST API to call
paddleocr-vl. - 3
Code Examples
Code examples for this provider have not been sourced yet.
About Novita AI
Novita AI offers a GPU-based inference API for image, video, and language model generation with a broad catalog of open-source models.
Pricing on Novita AI
| Type | Price (per 1M) |
|---|---|
| Input tokens | $0.02 |
| Output tokens | $0.02 |
Capabilities
VisionMultimodal
About PaddleOCR VL
PaddleOCR VL is a 0.9B ultra-compact vision-language model from Baidu's PaddlePaddle team for multilingual document parsing. Combines a NaViT-style dynamic resolution visual encoder with ERNIE-4.5-0.3B. Supports 109 languages for recognizing text, tables, formulas, and charts. Achieved 92.56 on OmniDocBench V1.5, surpassing larger models including DeepSeek-OCR. Released October 16, 2025.
Model Specs
Released2025-10-16
Parameters0.9B
Context16K
ArchitectureEncoder-Decoder