What is Imagen used for?

Imagen is used for image and vision and multimodal work. The family description and listed model capabilities point to those workloads as the best fit.

How does Imagen compare to T5Gemma?

Imagen by Google DeepMind is strongest where you need image, while T5Gemma by Google DeepMind is the closest related family to check for agent workflows and tool use. Imagen has 9 listed variants, so compare the specs and pricing tables before choosing a production model.

Which Imagen model should I use?

If price is the main constraint, use the pricing table first because Imagen does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate Imagen Product Recontext with multimodal inputs.

Imagen Models by Google DeepMind

Google DeepMindProprietary

9 models2024

Details

ResearcherGoogle DeepMind

LicenseProprietary

Commercial useCommercial use: conditional

Models9

Released2024

Capabilities

Vision4 of 9 models

Multimodal4 of 9 models

About

Google's Imagen family of image generation models, capable of producing high-quality, photorealistic images from text prompts. Includes Imagen 3.0 and 4.0 variants for different speed/quality trade-offs.

Current Variants

Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.

9 in view

Imagen 4 UltraCurrent

Use when the workload needs image.

2024-11image

Imagen 4Current

Use when the workload needs image.

2024-11image

Imagen 4 FastCurrent

Use when the workload needs image.

2024-11image

Imagen Product RecontextCurrent

Use when the workload needs image and multimodal inputs.

2024-10imagemultimodal inputs

Virtual Try-OnCurrent

Use when the workload needs image and multimodal inputs.

2024-09imagemultimodal inputs

Imagen 3Current

Use when the workload needs image.

2024-08image

Imagen 3 FastCurrent

Use when the workload needs image.

2024-08image

Imagen 3 for Editing and CustomizationCurrent

Use when the workload needs image and multimodal inputs.

2024-08imagemultimodal inputs

Imagen 3Current

Use when the workload needs image and multimodal inputs.

2024-08imagemultimodal inputs

Current Imagen variants with use-when guidance and lifecycle status
Model	Use when	Released	Signals	Status
Imagen 4 Ultra	Use when the workload needs image.	2024-11	image	Current
Imagen 4	Use when the workload needs image.	2024-11	image	Current
Imagen 4 Fast	Use when the workload needs image.	2024-11	image	Current
Imagen Product Recontext	Use when the workload needs image and multimodal inputs.	2024-10	imagemultimodal inputs	Current
Virtual Try-On	Use when the workload needs image and multimodal inputs.	2024-09	imagemultimodal inputs	Current
Imagen 3	Use when the workload needs image.	2024-08	image	Current
Imagen 3 Fast	Use when the workload needs image.	2024-08	image	Current
Imagen 3 for Editing and Customization	Use when the workload needs image and multimodal inputs.	2024-08	imagemultimodal inputs	Current
Imagen 3	Use when the workload needs image and multimodal inputs.	2024-08	imagemultimodal inputs	Current

Release Timeline

4 release groups

2024-11

3 current

Imagen 4

image

Current

Imagen 4 Fast

image

Current

Imagen 4 Ultra

image

Current

2024-10

1 current

Imagen Product Recontext

imagemultimodal inputs

Current

2024-09

1 current

Virtual Try-On

imagemultimodal inputs

Current

2024-08

4 current

Imagen 3

image

Current

Imagen 3

imagemultimodal inputs

Current

Imagen 3 Fast

image

Current

Imagen 3 for Editing and Customization

imagemultimodal inputs

Current

Specifications(9 models)

Imagen model specifications comparison
Model	Released	Vision	Multimodal
Imagen 4 Ultra	2024-11	No	No
Imagen 4	2024-11	No	No
Imagen 4 Fast	2024-11	No	No
Imagen Product Recontext	2024-10	Yes	Yes
Virtual Try-On	2024-09	Yes	Yes
Imagen 3	2024-08	No	No
Imagen 3 Fast	2024-08	No	No
Imagen 3 for Editing and Customization	2024-08	Yes	Yes
Imagen 3	2024-08	Yes	Yes

Available From(3 providers)

GCP Vertex AI

Google AI Studio

Vercel AI Gateway

Frequently Asked Questions

What is Imagen used for?: Imagen is used for image and vision and multimodal work. The family description and listed model capabilities point to those workloads as the best fit.
How does Imagen compare to T5Gemma?: Imagen by Google DeepMind is strongest where you need image, while T5Gemma by Google DeepMind is the closest related family to check for agent workflows and tool use. Imagen has 9 listed variants, so compare the specs and pricing tables before choosing a production model.
Which Imagen model should I use?: If price is the main constraint, use the pricing table first because Imagen does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate Imagen Product Recontext with multimodal inputs.

Models(9)

Imagen 4 Ultra

2024-113 providers

Imagen 4

2024-113 providers

Imagen 4 Fast

2024-113 providers

Imagen Product Recontext

Virtual Try-On

Imagen 3

Imagen 3 Fast

Imagen 3 for Editing and Customization

Imagen 3

Imagen Models by Google DeepMind

Details

Capabilities

About

Current Variants

Release Timeline

Specifications(9 models)

Available From(3 providers)

Frequently Asked Questions

Related Model Families

Models(9)