LLM Reference
GCP Vertex AI

Models on GCP Vertex AI

102 models available · Google Cloud Platform (GCP)

ModelInput (per 1M)Output (per 1M)Context
Gemini 1.5 Flash on Google Vertex AI$0.035$0.1051M
Gemma 2 9B$0.06$0.188K
Gemini 1.5 Flash on Google Vertex AI (Extended Context)$0.07$0.211M
gpt-oss-20b$0.07$0.25131072
Gemini 2.0 Flash-Lite$0.075$0.31.048576M
gpt-oss-120b$0.09$0.36131072
Gemini 2.5 Flash Lite$0.1$0.41M
Gemini 2.5 Flash Lite Preview 09-2025$0.1$0.41M
text-embedding-004 on Google Vertex AI$0.12048
CodeGemma on Google Vertex AI$0.125$0.3758192
Gemini 1.0 Pro on Google Vertex AI$0.125$0.37532768
Gemini 1.5 Pro on Google Vertex AI$0.125$0.3751M
Gemma 7B on Google Vertex AI$0.125$0.3758192
PaLM 2 (chat-bison) on Google Vertex AI$0.125$0.3758192
PaLM 2 (text-bison) on Google Vertex AI$0.125$0.3758192
Gemini 2.0 Flash$0.15$0.62M
Gemini 2.0 Flash Image Generation$0.15$301.048576M
Gemini Embedding$0.15Free
Gemini 3 Pro Preview$0.2$121M
Gemini 3.1 Pro Preview$0.2$121M
Qwen3 Coder 480B A35B Instruct$0.22$1.8256K
Claude 3 Haiku$0.25$1.25200K
Gemini 1.5 Pro on Google Vertex AI (Extended Context)$0.25$0.751M
Gemini 3.1 Flash Lite Preview$0.25$1.51M
Llama 4 Scout 17B-16E Instruct$0.25$0.7328K
Gemini 2.5 Flash$0.3$2.51M
Gemma 2 27B$0.3$0.98K
Nano Banana (Gemini 2.5 Flash Image)$0.3$3033K
Llama 4 Maverick 17B Instruct FP8$0.35$1.151M
Gemini 1.0 Pro$0.5$1.532K
Gemini 1.0 Pro Vision$0.5$1.512K
Gemini 1.5 Flash$0.5$1.51M
Gemini 2.0 Flash Live API$0.51M
Gemini 2.5 Flash Live API$0.5128K
Gemini 3 Flash Preview$0.5$31M
Nano Banana 2 (Gemini 3.1 Flash Image Preview)$0.5$6066K
GLM-4.7$0.6$2.2
Kimi K2 Thinking$0.6$2.5256K
Claude Haiku 4.5$1$5200k
GLM-5$1$3.2
Gemini 2.5 Pro$1.25$101M
Gemini 2.5 Pro Computer Use Preview$1.25$101.048576M
DeepSeek R1 0528$1.35$5.4160K
Claude 3 Sonnet$3$15200K
Claude 3.5 Sonnet$3$15200K
Claude 3.7 Sonnet$3$15200K
Claude Sonnet 4.5$3$15200K
Claude Sonnet 4.6$3$151M
Claude Opus 4.5$5$25200K
Claude Opus 4.6$5$251M
Gemini 1.5 Pro$5$152M
Llama 3.1-405B$5$16128k
Claude 3 Opus$15$75200K
Chat Bison
Claude 3.5 Haiku
Falcon 40B
Falcon 7B
Gemma 2B Instruct2K
Gemma 3
Gemma 3n
Gemma 7B8K
Gemma 7B Instruct8K
Imagen 3
Imagen 3
Imagen 3 Fast
Imagen 3 for Editing and Customization
Imagen 4
Imagen 4 Fast
Imagen 4 Ultra
Imagen Product Recontext
Llama 2 13B Chat4K
Llama 2 70B Chat4K
Llama 2 7B Chat4K
Llama 3 70B Instruct8K
Llama 3 8B Instruct8K
Lyria 2
Lyria 3 Clip
Lyria 3 Pro
MedGemma
MedLM Large
MedLM Medium
MedSigLIP
Mistral 7B v0.18K
Mistral Large32k
Mixtral 8x7B32K
Multimodal EmbeddingsFree
Nano Banana Pro (Gemini 3 Pro Image Preview)$12066K
PaliGemma
ShieldGemma 2
T5Gemma
Text Bison
TxGemma
Veo 2
Veo 3
Veo 3 Fast
Veo 3.1
Veo 3.1 Fast
Vicuna 13B2K
Vicuna 13B 16K16K
Vicuna 7B2K
Vicuna 7B 16K16K
Virtual Try-On

Pricing Overview

Cheapest$0.04/1M
Most expensive$15.00/1M

About GCP Vertex AI

Google Cloud Vertex AI is a comprehensive machine learning platform that provides end-to-end solutions for developing, deploying, and managing AI models. The platform offers a unified interface that integrates various tools and services, enabling users to efficiently handle the entire machine learning lifecycle. Key features include AutoML capabilities for building custom models with minimal coding, a managed notebook environment for prototyping, and robust MLOps tools for model monitoring and versioning. Vertex AI supports both pre-trained models and custom training, making it versatile for a wide range of applications such as natural language processing, image recognition, and predictive analytics. The platform's design focuses on increasing productivity and accelerating time-to-market for AI solutions. By consolidating multiple AI tools into a single ecosystem, Vertex AI reduces manual effort and enhances collaboration among data scientists and engineers. Its scalable architecture allows organizations to efficiently manage large datasets and complex models, while the pay-as-you-go pricing model makes it accessible for businesses of all sizes. Additionally, Vertex AI's integration with popular open-source frameworks like TensorFlow and PyTorch enables users to leverage existing models and tools, fostering innovation and facilitating the development of customized AI applications tailored to specific business needs.

Full provider profile →