PaddleOCR VL

Name: PaddleOCR VL
Author: Baidu AI

Released

2025-10-16

Last refreshed

2026-06-29

Status

Researched 45d ago

Open sourceCommercial use: permittedMultimodalVision

PaddleOCR VL is worth evaluating for vision when its provider route and context window match the workload.

Use it for

Teams evaluating vision
Workloads that can use a 16k context window
Buyers comparing 1 tracked provider route

Do not use it for

Strict JSON or tool-calling flows

Specifications

Family: PaddleOCR VL
Released: 2025-10-16
Context: 16k
Parameters: 0.9B
Architecture: Encoder-Decoder
Specialization: vision
Openness: Open source
License: Apache 2.0OSI-approvedCommercial use: permitted
Weights: Available
Code: Unknown
Training: Pretrained

Created by

Baidu AI

Innovative text-to-video and app builder

Beijing, China

Founded 2010

Website

Pricing

Output / 1M

$0.020

Input / 1M

$0.020

Cheapest of 1 route · Novita AI

Providers(1)

Novita AI

View 1 provider route

Links

Website HuggingFace

About

PaddleOCR VL is a 0.9B ultra-compact vision-language model from Baidu's PaddlePaddle team for multilingual document parsing. Combines a NaViT-style dynamic resolution visual encoder with ERNIE-4.5-0.3B. Supports 109 languages for recognizing text, tables, formulas, and charts. Achieved 92.56 on OmniDocBench V1.5, surpassing larger models including DeepSeek-OCR. Released October 16, 2025.

PaddleOCR VL is an open-source model. The structured metadata tracks a 16k-token context window and multimodal input. This page tracks provider routes through Novita AI, with the cheapest tracked route listed at $0.02 input and $0.02 output per 1M tokens. No headline benchmark score is tracked for PaddleOCR VL yet.

Top use-case fit

Vision

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Route
Novita AI	$0.020	$0.020	Serverless

Capabilities

VisionMultimodal

Benchmark peer barsfor Vision

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

Frequently asked questions

What is the context window of PaddleOCR VL?

PaddleOCR VL has a context window of 16k tokens.

How much does PaddleOCR VL cost?

PaddleOCR VL is available at $0.02/1M input tokens through Novita AI.

When was PaddleOCR VL released?

PaddleOCR VL was released on 2025-10-16.

Which providers offer PaddleOCR VL?

PaddleOCR VL is available from 1 provider: Novita AI.

Created by

Baidu AI

Innovative text-to-video and app builder

Beijing, China

Founded 2010

Website

Pricing

Output / 1M

$0.020

Input / 1M

$0.020

Cheapest of 1 route · Novita AI

Providers(1)

Novita AI

View 1 provider route

Links

Website HuggingFace