Vertex AI Multimodal Embeddings Models by Google DeepMind
Google DeepMindProprietary
1 model2024
Details
ResearcherGoogle DeepMind
LicenseProprietary
Commercial useCommercial use: conditional
Models1
Released2024
Capabilities
VisionAll models
MultimodalAll models
Links
WebsiteAbout
Google Cloud Vertex AI multimodal embedding models for text, image, and video retrieval.
Current Variants
Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.
1 in view
Use when the workload needs embedding and multimodal inputs.
2024-08embeddingmultimodal inputs
| Model | Use when | Released | Signals | Status |
|---|---|---|---|---|
| Vertex AI Multimodal Embeddings | Use when the workload needs embedding and multimodal inputs. | 2024-08 | embeddingmultimodal inputs | Current |
Release Timeline
1 release group2024-08
1 current
Vertex AI Multimodal Embeddings
Currentembeddingmultimodal inputs
Specifications(1 models)
| Model | Released | Vision | Multimodal |
|---|---|---|---|
| Vertex AI Multimodal Embeddings | 2024-08 | Yes | Yes |
Available From(1 provider)
Frequently Asked Questions
- What is Vertex AI Multimodal Embeddings used for?
- Vertex AI Multimodal Embeddings is used for embedding and vision and multimodal work. The family description and listed model capabilities point to those workloads as the best fit.
- How does Vertex AI Multimodal Embeddings compare to T5Gemma?
- Vertex AI Multimodal Embeddings by Google DeepMind is strongest where you need embedding, while T5Gemma by Google DeepMind is the closest related family to check for agent workflows and tool use. Vertex AI Multimodal Embeddings has 1 listed variant, so compare the specs and pricing tables before choosing a production model.
- Which Vertex AI Multimodal Embeddings model should I use?
- If price is the main constraint, use the pricing table first because Vertex AI Multimodal Embeddings does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate Vertex AI Multimodal Embeddings with multimodal inputs.



