LLM ReferenceLLM Reference
3 models2024Up to 512 ctx

About

PaliGemma is a family of open-source vision-language models (VLMs) developed by Google, emphasizing lightweight design and efficiency compared to other large language models. Built using open components, including the SigLIP vision model and the Gemma language model, PaliGemma models seamlessly process both images and text to deliver text outputs. This capability makes them well-suited for tasks such as image captioning, visual question answering, and object detection. Available in resolutions ranging from 224x224 to 896x896, these models are offered in various forms including pre-trained, mix, and fine-tuned versions to meet diverse research and practical needs. While useful for direct inference, they excel when fine-tuned for specific applications 13578.

Specifications(3 models)

PaliGemma model specifications comparison
ModelReleasedContextParametersVisionMultimodal
PaliGemma 3B 8962024-055123BYesYes
PaliGemma 3B 4482024-055123BNoNo
PaliGemma 3B 2242024-051283BNoNo

Available From(1 provider)

Frequently Asked Questions

What is PaliGemma?
PaliGemma is a family of open-source vision-language models (VLMs) developed by Google, emphasizing lightweight design and efficiency compared to other large language models. Built using open components, including the SigLIP vision model and the Gemma language model, PaliGemma models seamlessly process both images and text to deliver text outputs. This capability makes them well-suited for tasks such as image captioning, visual question answering, and object detection. Available in resolutions ranging from 224x224 to 896x896, these models are offered in various forms including pre-trained, mix, and fine-tuned versions to meet diverse research and practical needs. While useful for direct inference, they excel when fine-tuned for specific applications 13578.
How many models are in the PaliGemma family?
The PaliGemma family contains 3 models.
What is the latest PaliGemma model?
The latest model is PaliGemma 3B 896, released in 2024-05.

Models(3)