LLM Reference
NVIDIA NIM

DePlot on NVIDIA NIM

DePlot · Google DeepMind

Provisioned

Pricing

TypePrice (per 1M)
Input tokensFree
Output tokensFree

Capabilities

VisionMultimodalReasoningFunction CallingTool UseJSON ModeCode Execution

About DePlot

DePlot is a multimodal AI model designed for visual language reasoning, specifically for interpreting charts and plots. It operates through a two-step process: converting plot images into linearized tables with a modality conversion module, and then using a large language model to analyze this textual data to answer complex queries about the visuals. This innovative "one-shot" approach enhances performance, achieving a 24% improvement over current models on the ChartQA benchmark for human-written queries. DePlot finds applications in data analysis, automated report generation, and educational tools, offering efficient insights from visual data while handling diverse chart types and open-ended questions.

Get Started

Model Specs

Released2023-10-26
Parameters282M
ArchitectureDecoder Only