LLM Reference
GCP Vertex AI

GCP Vertex AI Models — Pricing & Benchmarks

127 models available · Google Cloud Platform (GCP)

GCP Vertex AI hosts 127 AI models in this catalog. The lowest listed input price is Gemma 3 at Free, with 13 free-tier models. LLM Reference lets you compare these models across all 80 providers without switching tabs.

ModelInput (per 1M)Output (per 1M)Context
Gemma 3FreeFree
Gemma 3nFreeFree32k
Gemma 4 E2BFreeFree128k
Gemma 4 E2B ITFreeFree128k
Gemma 4 E4BFreeFree128k
Gemma 4 E4B ITFreeFree128k
MedGemmaFreeFree
MedSigLIPFreeFree
PaliGemmaFreeFree
ShieldGemma 2FreeFree
T5GemmaFreeFree
TxGemmaFreeFree
Vertex AI Multimodal EmbeddingsFreeFree
Gemini 1.5 Flash on Google Vertex AI$0.035$0.1051m
Gemini 1.5 Flash 8B$0.037$0.151m
Gemma 2B Instruct$0.04$0.122k
Gemma 3 12B$0.04$0.1333k
Gemma 3 4B IT$0.04$0.08128k
Gemma 2 9B$0.06$0.188k
Gemini 1.5 Flash on Google Vertex AI (Extended Context)$0.07$0.211m
gpt-oss-20b$0.07$0.25131k
Gemini 1.5 Flash$0.075$0.31m
Gemini 2.0 Flash-Lite$0.075$0.31.05m
Gemma 3 27B$0.08$0.16131k
Llama 2 7B Chat$0.08$0.244k
Mistral 7B v0.1$0.08$0.248k
gpt-oss-120b$0.09$0.36131k
Gemini 2.5 Flash Lite$0.1$0.41m
Gemini 2.5 Flash Lite Preview 09-2025$0.1$0.41m
Gemma 7B$0.1$0.38k
Gemma 7B Instruct$0.1$0.38k
GLM-4 9B$0.1$0.1131k
text-embedding-004 on Google Vertex AI$0.12k
Llama 3 8B Instruct$0.12$0.368k
CodeGemma on Google Vertex AI$0.125$0.3758k
Gemini 1.0 Pro on Google Vertex AI$0.125$0.37533k
Gemini 1.5 Pro on Google Vertex AI$0.125$0.3751m
Gemma 7B on Google Vertex AI$0.125$0.3758k
PaLM 2 (chat-bison) on Google Vertex AI$0.125$0.3758k
PaLM 2 (text-bison) on Google Vertex AI$0.125$0.3758k
Gemini 2.0 Flash$0.15$0.62m
Gemini 2.0 Flash Image Generation$0.15$30.001.05m
Gemini Embedding$0.15Free
Gemma 4 26B A4B IT$0.15$0.6256k
Gemma 4 31B IT$0.15$0.6256k
Mistral 7B Instruct$0.15$0.2
Llama 2 13B Chat$0.16$0.484k
Llama 4 Scout 17B-16E Instruct$0.2$0.6510m
Qwen3-Coder-480B-A35B-Instruct$0.22$1.80262k
Claude 3 Haiku$0.25$1.25200k
Gemini 1.5 Pro on Google Vertex AI (Extended Context)$0.25$0.751m
Gemini 3.1 Flash Lite Preview$0.25$1.501m
Gemini 2.5 Flash$0.3$2.501m
Gemma 2 27B$0.3$0.98k
Nano Banana (Gemini 2.5 Flash Image)$0.3$30.0033k
Mistral Large$0.32$0.9632k
Llama 4 Maverick 17B Instruct FP8$0.35$1.151m
Mixtral 8x7B$0.4$1.2032k
Chat Bison$0.5$0.58k
Gemini 1.0 Pro$0.5$1.5032k
Gemini 1.0 Pro Vision$0.5$1.5012k
Gemini 2.0 Flash Live API$0.51m
Gemini 2.5 Flash Live API$0.5128k
Gemini 3 Flash$0.5$3.001m
Gemini 3 Flash Preview$0.5$3.001m
Kimi K2$0.5$2.00262k
Nano Banana 2 (Gemini 3.1 Flash Image Preview)$0.5$60.0066k
Text Bison$0.5$0.58k
GLM-4.7$0.6$2.20128k
Kimi K2 Thinking$0.6$2.50256k
DeepSeek V3$0.75$3.0064k
Claude 3.5 Haiku$0.8$4.00200k
Claude Haiku 4.5$0.8$4.00200k
Llama 2 70B Chat$0.8$2.404k
Gemini 1.0 Ultra$1.00$3.001m
GLM-5$1.00$3.20200k
Llama 3 70B Instruct$1.20$3.608k
Gemini 1.5 Pro$1.25$5.002m
Gemini 2.5 Pro$1.25$10.001m
Gemini 2.5 Pro Computer Use Preview$1.25$10.001.05m
Gemini 3 Pro$1.25$5.001m
DeepSeek R1$1.35$5.40128k
DeepSeek R1 0528$1.35$5.40130k
Gemini 3.5 Flash$1.50$9.001.05m
Gemini 3 Pro Preview$2.00$12.001m
Gemini 3.1 Pro Preview$2.00$12.001m
Claude 3 Sonnet$3.00$15.00200k
Claude 3.5 Sonnet$3.00$15.00200k
Claude 3.7 Sonnet$3.00$15.00200k
Claude Sonnet 4.5$3.00$15.00200k
Claude Sonnet 4.6$3.00$15.001m
Claude Sonnet 5$3.00$15.001m
Nano Banana Pro (Gemini 3 Pro Image Preview)$3.00$15.0066k
Claude Opus 4.5$5.00$25.00200k
Claude Opus 4.7$5.00$25.001m
Llama 3.1 405B Instruct$5.00$16.00128k
Llama 3.1-405B$5.00$16.00128k
Claude Fable 5$10.00$50.001m
Claude 3 Opus$15.00$75.00200k
Claude Opus 4.6$15.00$75.001m
Claude Mythos 51m
Claude Opus 4.81m
Falcon 40B
Falcon 7B
Imagen 3
Imagen 3
Imagen 3 Fast
Imagen 3 for Editing and Customization
Imagen 4
Imagen 4 Fast
Imagen 4 Ultra
Imagen Product Recontext
Lyria 2
Lyria 3 Clip
Lyria 3 Pro
MedLM Large32k
MedLM Medium32k
Veo 2
Veo 3
Veo 3 Fast
Veo 3.1
Veo 3.1 Fast
Vicuna 13B2k
Vicuna 13B 16K16k
Vicuna 7B2k
Vicuna 7B 16K16k
Virtual Try-On

Where else to run this

Pricing Overview

Cheapest$0.04/1M
Most expensive$15.00/1M
13 free tier models

About GCP Vertex AI

Google Cloud Vertex AI is a comprehensive machine learning platform that provides end-to-end solutions for developing, deploying, and managing AI models. The platform offers a unified interface that integrates various tools and services, enabling users to efficiently handle the entire machine learning lifecycle. Key features include AutoML capabilities for building custom models with minimal coding, a managed notebook environment for prototyping, and robust MLOps tools for model monitoring and versioning. Vertex AI supports both pre-trained models and custom training, making it versatile for a wide range of applications such as natural language processing, image recognition, and predictive analytics. The platform's design focuses on increasing productivity and accelerating time-to-market for AI solutions. By consolidating multiple AI tools into a single ecosystem, Vertex AI reduces manual effort and enhances collaboration among data scientists and engineers. Its scalable architecture allows organizations to efficiently manage large datasets and complex models, while the pay-as-you-go pricing model makes it accessible for businesses of all sizes. Additionally, Vertex AI's integration with popular open-source frameworks like TensorFlow and PyTorch enables users to leverage existing models and tools, fostering innovation and facilitating the development of customized AI applications tailored to specific business needs.

Full provider profile →