What is Lyria used for?

Lyria is used for audio and vision and multimodal work. The family description and listed model capabilities point to those workloads as the best fit.

How does Lyria compare to T5Gemma?

Lyria by Google DeepMind is strongest where you need audio, while T5Gemma by Google DeepMind is the closest related family to check for agent workflows and tool use. Lyria has 4 listed variants, so compare the specs and pricing tables before choosing a production model.

Which Lyria model should I use?

If price is the main constraint, use the pricing table first because Lyria does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate Lyria 3 Pro with multimodal inputs.

Lyria Models by Google DeepMind

Google DeepMindProprietary

4 models2024–2025

Details

ResearcherGoogle DeepMind

LicenseProprietary

Commercial useCommercial use: conditional

Models4

Released2024–2025

Capabilities

Vision2 of 4 models

Multimodal2 of 4 models

About

Google DeepMind's Lyria family of music and audio generation models. Creates high-quality music tracks, audio clips, and real-time audio from text prompts.

Current Variants

Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.

4 in view

Lyria 3 ProCurrent

Use when the workload needs audio and multimodal inputs.

2025-01audiomultimodal inputs

Lyria 3 ClipCurrent

Use when the workload needs audio and multimodal inputs.

2025-01audiomultimodal inputs

Lyria RealTimeCurrent

Use when the workload needs audio.

2024-11audio

Lyria 2Current

Use when the workload needs audio.

2024-06audio

Current Lyria variants with use-when guidance and lifecycle status
Model	Use when	Released	Signals	Status
Lyria 3 Pro	Use when the workload needs audio and multimodal inputs.	2025-01	audiomultimodal inputs	Current
Lyria 3 Clip	Use when the workload needs audio and multimodal inputs.	2025-01	audiomultimodal inputs	Current
Lyria RealTime	Use when the workload needs audio.	2024-11	audio	Current
Lyria 2	Use when the workload needs audio.	2024-06	audio	Current

Release Timeline

3 release groups

2025-01

2 current

Lyria 3 Clip

audiomultimodal inputs

Current

Lyria 3 Pro

audiomultimodal inputs

Current

2024-11

1 current

Lyria RealTime

audio

Current

2024-06

1 current

Lyria 2

audio

Current

Specifications(4 models)

Lyria model specifications comparison
Model	Released	Vision	Multimodal
Lyria 3 Pro	2025-01	Yes	Yes
Lyria 3 Clip	2025-01	Yes	Yes
Lyria RealTime	2024-11	No	No
Lyria 2	2024-06	No	No

Available From(3 providers)

GCP Vertex AI

Google AI Studio

OpenRouter

Frequently Asked Questions

What is Lyria used for?: Lyria is used for audio and vision and multimodal work. The family description and listed model capabilities point to those workloads as the best fit.
How does Lyria compare to T5Gemma?: Lyria by Google DeepMind is strongest where you need audio, while T5Gemma by Google DeepMind is the closest related family to check for agent workflows and tool use. Lyria has 4 listed variants, so compare the specs and pricing tables before choosing a production model.
Which Lyria model should I use?: If price is the main constraint, use the pricing table first because Lyria does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate Lyria 3 Pro with multimodal inputs.

Models(4)

Lyria 3 Pro

Lyria 3 Clip

Lyria RealTime

Lyria 2

Lyria Models by Google DeepMind

Details

Capabilities

About

Current Variants

Release Timeline

Specifications(4 models)

Available From(3 providers)

Frequently Asked Questions

Related Model Families

Models(4)