LLM ReferenceLLM Reference

Gemini Embedding Models by Google DeepMind

Google DeepMindProprietary
5 models2023–2026Up to 2K ctxFrom $0.1/1M input

About

Google's Gemini Embedding models for generating text and multimodal embeddings. Used for semantic search, retrieval, classification, and clustering tasks.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

5 in view

Use when the workload needs embedding and multimodal inputs.

2026-04embeddingmultimodal inputs

Use when the workload needs embedding, multimodal inputs, and audio.

2026-04embeddingmultimodal inputsaudio

Use when the workload needs embedding and multimodal inputs.

2024-08embeddingmultimodal inputs

Use when the workload needs embedding.

2023-12embedding

Use when the workload needs 2K context.

2023-022K context

Release Timeline

4 release groups
2026-04
2 current
Gemini Embedding 2
embeddingmultimodal inputsaudio
Current
Gemini Embedding 2 Preview
embeddingmultimodal inputs
Current
2024-08
1 current
Multimodal Embeddings
embeddingmultimodal inputs
Current
2023-12
1 current
Current
2023-02
1 current

Specifications(5 models)

Gemini Embedding model specifications comparison
ModelReleasedContextVisionMultimodal
Gemini Embedding 2 Preview2026-04YesYes
Gemini Embedding 22026-04YesYes
Multimodal Embeddings2024-08YesYes
Gemini Embedding2023-12NoNo
text-embedding-004 on Google Vertex AI2023-022KNoNo

Available From(2 providers)

Pricing

Gemini Embedding model pricing by provider
ModelProviderInput / 1MOutput / 1MType
text-embedding-004 on Google Vertex AIGCP Vertex AI$0.1Serverless
Gemini EmbeddingGoogle AI Studio$0.15Serverless
Gemini EmbeddingGCP Vertex AI$0.15Serverless
Gemini Embedding 2 PreviewGoogle AI Studio$0.2Serverless
Gemini Embedding 2Google AI Studio$0.2Serverless

Frequently Asked Questions

What is Gemini Embedding used for?
Gemini Embedding is used for embedding and vision and multimodal work. The family description and listed model capabilities point to those workloads as the best fit.
How does Gemini Embedding compare to Gemma 4?
Gemini Embedding by Google DeepMind is strongest where you need embedding, while Gemma 4 by Google DeepMind is the closest related family to check for vision and multimodal work. Gemini Embedding has 5 listed variants and reaches up to 2K context, while Gemma 4 reaches up to 256K context, so compare the specs and pricing tables before choosing a production model.
Which Gemini Embedding model should I use?
For the lowest listed input price, start with text-embedding-004 on Google Vertex AI through GCP Vertex AI at $0.1/1M input tokens. For the most capable/latest local choice, evaluate Gemini Embedding 2 Preview with multimodal inputs.

Models(5)