LLM ReferenceLLM Reference
GCP Vertex AI

Gemini 2.0 Flash Live API on GCP Vertex AI

Gemini 2.0 · Google DeepMind

Serverless

Last refreshed 2026-05-11. Next refresh: weekly.

Why use Gemini 2.0 Flash Live API on GCP Vertex AI?

GCP Vertex AI offers Gemini 2.0 Flash Live API with pay-as-you-go pricing at $0.50/1M input tokens. Vertex AI is Google Cloud's managed AI platform, offering access to Gemini models and hundreds of partner models alongside tools for fine-tuning, grounding, vector search, and end-to-end MLOps pipelines.

Input / 1M
$0.50
Output / 1M
-
Cache
Not sourced
Batch
Not sourced

Setup recipe

Python + curl
Install
pip install google-cloud-aiplatform
Auth
export GOOGLE_CLOUD_PROJECT=...
Call
import os
import vertexai
from vertexai.generative_models import GenerativeModel
vertexai.init(project=os.environ["GOOGLE_CLOUD_PROJECT"], location="us-central1")
Model ID
gemini-2.0-flash-live-001

Request example

import os
import vertexai
from vertexai.generative_models import GenerativeModel

# Reads GOOGLE_CLOUD_PROJECT from env; authenticates via Application Default Credentials
vertexai.init(project=os.environ["GOOGLE_CLOUD_PROJECT"], location="us-central1")
model = GenerativeModel("gemini-2.0-flash-live-001")
response = model.generate_content("Hello")
print(response.text)

Gotchas

  • For Google-published models use the model name directly, e.g. "gemini-2.0-flash-001". For third-party publishers (Anthropic, Meta, etc.) use the full publisher path, e.g. "publishers/anthropic/models/claude-3-5-sonnet-v2@20241022".
  • The examples expect GOOGLE_CLOUD_PROJECT; rename it only if your application config maps the new variable.

Pricing

TypePrice (per 1M)
Input tokens$0.50

Capabilities

VisionMultimodalFunction CallingTool UseStructured Outputs

About Gemini 2.0 Flash Live API

Gemini 2.0 Flash live variant for real-time multimodal streaming.

FAQ

What is the context window for Gemini 2.0 Flash Live API on GCP Vertex AI?

Gemini 2.0 Flash Live API supports a 1,000,000 token context window on GCP Vertex AI.

Who created Gemini 2.0 Flash Live API?

Gemini 2.0 Flash Live API was created by Google DeepMind as part of the Gemini 2.0 model family.

Is Gemini 2.0 Flash Live API open source?

Gemini 2.0 Flash Live API is not open source; the seed data lists it as proprietary.

Get Started

Model Specs

Released2025-03-01
Context1M
ArchitectureDecoder Only

Provider

GCP Vertex AI
GCP Vertex AI

Google Cloud Platform (GCP)

All models on GCP Vertex AI