LLM Reference

Vertex AI Multimodal Embeddings Models by Google DeepMind

Google DeepMindProprietary
1 model2024

Details

ResearcherGoogle DeepMind
LicenseProprietary
Commercial useCommercial use: conditional
Models1
Released2024

Capabilities

VisionAll models
MultimodalAll models

Links

Website

About

Google Cloud Vertex AI multimodal embedding models for text, image, and video retrieval.

Current Variants

Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.

1 in view

Use when the workload needs embedding and multimodal inputs.

2024-08embeddingmultimodal inputs

Release Timeline

1 release group
2024-08
1 current
Vertex AI Multimodal Embeddings
embeddingmultimodal inputs
Current

Specifications(1 models)

Vertex AI Multimodal Embeddings model specifications comparison
ModelReleasedVisionMultimodal
Vertex AI Multimodal Embeddings2024-08YesYes

Available From(1 provider)

Frequently Asked Questions

What is Vertex AI Multimodal Embeddings used for?
Vertex AI Multimodal Embeddings is used for embedding and vision and multimodal work. The family description and listed model capabilities point to those workloads as the best fit.
How does Vertex AI Multimodal Embeddings compare to T5Gemma?
Vertex AI Multimodal Embeddings by Google DeepMind is strongest where you need embedding, while T5Gemma by Google DeepMind is the closest related family to check for agent workflows and tool use. Vertex AI Multimodal Embeddings has 1 listed variant, so compare the specs and pricing tables before choosing a production model.
Which Vertex AI Multimodal Embeddings model should I use?
If price is the main constraint, use the pricing table first because Vertex AI Multimodal Embeddings does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate Vertex AI Multimodal Embeddings with multimodal inputs.