Using PaddleOCR VL on Novita AI

Implementation guide · PaddleOCR VL · Baidu AI

ServerlessOpen Source

Quick Start

1
Create an account at Novita AI and generate an API key.
2
Use the Novita AI SDK or REST API to call paddleocr-vl.
3
You'll be billed $0.02/1M input, $0.02/1M output tokens. See full pricing.

API Portal Pricing Model Card

Code Examples

Code examples for this provider have not been sourced yet.

About Novita AI

Novita AI offers a GPU-based inference API for image, video, and language model generation with a broad catalog of open-source models.

View all models on Novita AI →

Pricing on Novita AI

Type	Price (per 1M)
Input tokens	$0.02
Output tokens	$0.02

Capabilities

VisionMultimodal

About PaddleOCR VL

PaddleOCR VL is a 0.9B ultra-compact vision-language model from Baidu's PaddlePaddle team for multilingual document parsing. Combines a NaViT-style dynamic resolution visual encoder with ERNIE-4.5-0.3B. Supports 109 languages for recognizing text, tables, formulas, and charts. Achieved 92.56 on OmniDocBench V1.5, surpassing larger models including DeepSeek-OCR. Released October 16, 2025.

Full model details →

Model Specs

Released2025-10-16

Parameters0.9B

Context16k

ArchitectureEncoder-Decoder

Provider

Novita AI