LLM Reference
GCP Vertex AI

Gemma 7B on Google Vertex AI on GCP Vertex AI

Gemma · Google DeepMind

ServerlessOpen Weights

Last refreshed 2026-06-15. Next refresh: weekly.

Why use Gemma 7B on Google Vertex AI on GCP Vertex AI?

GCP Vertex AI offers Gemma 7B on Google Vertex AI with pay-as-you-go pricing at $0.13/1M input tokens. Vertex AI is Google Cloud's managed AI platform, offering access to Gemini models and hundreds of partner models alongside tools for fine-tuning, grounding, vector search, and end-to-end MLOps pipelines.

Input / 1M
$0.125
Output / 1M
$0.375
Cache
Not sourced
Batch
Not sourced

Setup recipe

Python + curl
Install
pip install google-cloud-aiplatform
Auth
export GOOGLE_CLOUD_PROJECT=...
Call
import os
import vertexai
from vertexai.generative_models import GenerativeModel
vertexai.init(project=os.environ["GOOGLE_CLOUD_PROJECT"], location="us-central1")
Model ID
vertex-gemma-7b

Request example

import os
import vertexai
from vertexai.generative_models import GenerativeModel

# Reads GOOGLE_CLOUD_PROJECT from env; authenticates via Application Default Credentials
vertexai.init(project=os.environ["GOOGLE_CLOUD_PROJECT"], location="us-central1")
model = GenerativeModel("vertex-gemma-7b")
response = model.generate_content("Hello")
print(response.text)

Gotchas

  • For Google-published models use the model name directly, e.g. "gemini-2.0-flash-001". For third-party publishers (Anthropic, Meta, etc.) use the full publisher path, e.g. "publishers/anthropic/models/claude-3-5-sonnet-v2@20241022".
  • The examples expect GOOGLE_CLOUD_PROJECT; rename it only if your application config maps the new variable.

Pricing

TypePrice (per 1M)
Input tokens$0.13
Output tokens$0.38

Capabilities

Structured Outputs

About Gemma 7B on Google Vertex AI

Gemma 7B on Google Vertex AI is Google DeepMind's Gemma model. It offers an 8K-token context window with weights openly available for self-hosting.

FAQ

What does Gemma 7B on Google Vertex AI cost on GCP Vertex AI?

On GCP Vertex AI, Gemma 7B on Google Vertex AI costs $0.125 per 1M input tokens and $0.375 per 1M output tokens.

What is the context window for Gemma 7B on Google Vertex AI on GCP Vertex AI?

Gemma 7B on Google Vertex AI supports a 8k token context window on GCP Vertex AI.

Who created Gemma 7B on Google Vertex AI?

Gemma 7B on Google Vertex AI was created by Google DeepMind as part of the Gemma model family.

Is Gemma 7B on Google Vertex AI open source?

Gemma 7B on Google Vertex AI has open weights under Gemma according to the seed data, but that does not necessarily mean an OSI-approved open-source license.

Get Started

Model Specs

Released2024-02-21
Parameters7B
Context8k
ArchitectureDecoder Only

Related Models on GCP Vertex AI