LLM Reference

PaddleOCR VL on Novita AI

PaddleOCR VL · Baidu AI

ServerlessOpen Source

Last refreshed 2026-05-22. Next refresh: weekly.

Why use PaddleOCR VL on Novita AI?

Novita AI offers PaddleOCR VL with pay-as-you-go pricing at $0.02/1M input tokens. Novita AI offers a GPU-based inference API for image, video, and language model generation with a broad catalog of open-source models.

Input / 1M
$0.020
Output / 1M
$0.020
Cache
Not sourced
Batch
Not sourced

Setup recipe

Docs fallback
Install
Use the provider REST API or SDK
Auth
Create a provider API key
Call
model: paddleocr-vl
Model ID
paddleocr-vl

Request example

Curated snippets for this provider have not been sourced yet.

Gotchas

No curated gotchas have been sourced for this exact provider/model route yet.

Pricing

TypePrice (per 1M)
Input tokens$0.02
Output tokens$0.02

Capabilities

VisionMultimodal

About PaddleOCR VL

PaddleOCR VL is a 0.9B ultra-compact vision-language model from Baidu's PaddlePaddle team for multilingual document parsing. Combines a NaViT-style dynamic resolution visual encoder with ERNIE-4.5-0.3B. Supports 109 languages for recognizing text, tables, formulas, and charts. Achieved 92.56 on OmniDocBench V1.5, surpassing larger models including DeepSeek-OCR. Released October 16, 2025.

FAQ

What does PaddleOCR VL cost on Novita AI?

On Novita AI, PaddleOCR VL costs $0.02 per 1M input tokens and $0.02 per 1M output tokens.

What is the context window for PaddleOCR VL on Novita AI?

PaddleOCR VL supports a 16,384 token context window on Novita AI.

What API model ID do I use for PaddleOCR VL on Novita AI?

Use the model ID paddleocr-vl when calling Novita AI's API.

Who created PaddleOCR VL?

PaddleOCR VL was created by Baidu AI as part of the PaddleOCR VL model family.

Is PaddleOCR VL open source?

PaddleOCR VL is open source according to the seed data.

Get Started

Model Specs

Released2025-10-16
Parameters0.9B
Context16K
ArchitectureEncoder-Decoder