LLM ReferenceLLM Reference
GCP Vertex AI

GCP Vertex AI Models — Pricing & Benchmarks

122 models available · Google Cloud Platform (GCP)

GCP Vertex AI hosts 122 AI models in this catalog. The lowest listed input price is Gemma 3 at Free, with 16 free-tier models. LLM Reference lets you compare these models across all 63 providers without switching tabs.

ModelInput (per 1M)Output (per 1M)Context
Gemma 3FreeFree
Gemma 3 12B (free)FreeFree33K
Gemma 3 27B (free)FreeFree131K
Gemma 3 4B ITFreeFree
Gemma 3nFreeFree
Gemma 4 E2BFreeFree128k
Gemma 4 E2B ITFreeFree128k
Gemma 4 E4BFreeFree128k
Gemma 4 E4B ITFreeFree128k
MedGemmaFreeFree
MedSigLIPFreeFree
Multimodal EmbeddingsFreeFree
PaliGemmaFreeFree
ShieldGemma 2FreeFree
T5GemmaFreeFree
TxGemmaFreeFree
Gemini 1.5 Flash on Google Vertex AI$0.035$0.1051M
Gemini 1.5 Flash 8B$0.0375$0.15
Gemma 2B Instruct$0.04$0.122K
Gemma 2 9B$0.06$0.188K
Gemini 1.5 Flash on Google Vertex AI (Extended Context)$0.07$0.211M
gpt-oss-20b$0.07$0.25131K
Gemini 1.5 Flash$0.075$0.301M
Gemini 2.0 Flash-Lite$0.075$0.301.048576M
Llama 2 7B Chat$0.08$0.244K
Mistral 7B v0.1$0.08$0.248K
gpt-oss-120b$0.09$0.36131K
Gemini 2.0 Flash$0.10$0.402M
Gemini 2.5 Flash Lite$0.1$0.41M
Gemini 2.5 Flash Lite Preview 09-2025$0.1$0.41M
Gemma 7B$0.10$0.308K
Gemma 7B Instruct$0.10$0.308K
GLM-4 9B$0.10$0.10131K
text-embedding-004 on Google Vertex AI$0.12K
Llama 3 8B Instruct$0.12$0.368K
CodeGemma on Google Vertex AI$0.125$0.3758K
Gemini 1.0 Pro on Google Vertex AI$0.125$0.37533K
Gemini 1.5 Pro on Google Vertex AI$0.125$0.3751M
Gemma 7B on Google Vertex AI$0.125$0.3758K
PaLM 2 (chat-bison) on Google Vertex AI$0.125$0.3758K
PaLM 2 (text-bison) on Google Vertex AI$0.125$0.3758K
Gemini 2.0 Flash Image Generation$0.15$301.048576M
Gemini Embedding$0.15Free
Gemma 4 26B A4B IT$0.15$0.60256k
Gemma 4 31B IT$0.15$0.60256k
Mistral 7B Instruct$0.15$0.20
Llama 2 13B Chat$0.16$0.484K
Llama 4 Scout 17B-16E Instruct$0.20$0.65328K
Qwen3-Coder-480B-A35B-Instruct$0.22$1.8256K
Claude 3 Haiku$0.25$1.25200K
Gemini 1.5 Pro on Google Vertex AI (Extended Context)$0.25$0.751M
Gemini 3.1 Flash Lite Preview$0.25$1.51M
Gemini 2.5 Flash$0.30$2.501M
Gemma 2 27B$0.3$0.98K
Nano Banana (Gemini 2.5 Flash Image)$0.3$3033K
Mistral Large$0.32$0.9632k
Llama 4 Maverick 17B Instruct FP8$0.35$1.151M
Mixtral 8x7B$0.40$1.2032K
Chat Bison$0.50$0.50
Gemini 1.0 Pro$0.50$1.5032K
Gemini 1.0 Pro Vision$0.5$1.512K
Gemini 2.0 Flash Live API$0.51M
Gemini 2.5 Flash Live API$0.5128K
Gemini 3 Flash$0.50$3.001M
Gemini 3 Flash Preview$0.5$31M
Kimi K2$0.50$2.00262K
Nano Banana 2 (Gemini 3.1 Flash Image Preview)$0.5$6066K
Text Bison$0.50$0.50
GLM-4.7$0.6$2.2
Kimi K2 Thinking$0.6$2.5256K
DeepSeek V3$0.75$3.0064k
Claude 3.5 Haiku$0.8$4200k
Claude Haiku 4.5$0.8$4200k
Llama 2 70B Chat$0.80$2.404K
Gemini 1.0 Ultra$1.00$3.001M
GLM-5$1$3.2200k
Llama 3 70B Instruct$1.20$3.608K
Gemini 1.5 Pro$1.25$5.002M
Gemini 2.5 Pro$1.25$10.001M
Gemini 2.5 Pro Computer Use Preview$1.25$101.048576M
Gemini 3 Pro$1.25$5.001M
DeepSeek R1$1.35$5.40128K
DeepSeek R1 0528$1.35$5.4160K
Gemini 3 Pro Preview$2.00$12.001M
Gemini 3.1 Pro Preview$2.00$12.001M
Claude 3 Sonnet$3$15200K
Claude 3.5 Sonnet$3$15200K
Claude 3.7 Sonnet$3$15200K
Claude Sonnet 4.5$3$15200K
Claude Sonnet 4.6$3$151M
Nano Banana Pro (Gemini 3 Pro Image Preview)$3.00$15.0066K
Claude Opus 4.5$5$25200K
Claude Opus 4.7$5$251M
Llama 3.1 405B Instruct$5.00$16.00128K
Llama 3.1-405B$5$16128k
Claude 3 Opus$15$75200K
Claude Opus 4.6$15$751M
Falcon 40B
Falcon 7B
Imagen 3
Imagen 3
Imagen 3 Fast
Imagen 3 for Editing and Customization
Imagen 4
Imagen 4 Fast
Imagen 4 Ultra
Imagen Product Recontext
Lyria 2
Lyria 3 Clip
Lyria 3 Pro
MedLM Large
MedLM Medium
Veo 2
Veo 3
Veo 3 Fast
Veo 3.1
Veo 3.1 Fast
Vicuna 13B2K
Vicuna 13B 16K16K
Vicuna 7B2K
Vicuna 7B 16K16K
Virtual Try-On

Pricing Overview

Cheapest$0.04/1M
Most expensive$15.00/1M
16 free tier models

About GCP Vertex AI

Google Cloud Vertex AI is a comprehensive machine learning platform that provides end-to-end solutions for developing, deploying, and managing AI models. The platform offers a unified interface that integrates various tools and services, enabling users to efficiently handle the entire machine learning lifecycle. Key features include AutoML capabilities for building custom models with minimal coding, a managed notebook environment for prototyping, and robust MLOps tools for model monitoring and versioning. Vertex AI supports both pre-trained models and custom training, making it versatile for a wide range of applications such as natural language processing, image recognition, and predictive analytics. The platform's design focuses on increasing productivity and accelerating time-to-market for AI solutions. By consolidating multiple AI tools into a single ecosystem, Vertex AI reduces manual effort and enhances collaboration among data scientists and engineers. Its scalable architecture allows organizations to efficiently manage large datasets and complex models, while the pay-as-you-go pricing model makes it accessible for businesses of all sizes. Additionally, Vertex AI's integration with popular open-source frameworks like TensorFlow and PyTorch enables users to leverage existing models and tools, fostering innovation and facilitating the development of customized AI applications tailored to specific business needs.

Full provider profile →