LLM ReferenceLLM Reference
GCP Vertex AI

Gemini 3 Flash on GCP Vertex AI

Gemini 3 · Google DeepMind

Serverless

Compare Gemini 3 Flash Across Providers

ProviderInput (per 1M)Output (per 1M)
Replicate API$0.50$3.00
GCP Vertex AI$0.10$0.40

Pricing

TypePrice (per 1M)
Input tokens$0.10
Output tokens$0.40

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution

About Gemini 3 Flash

Speed-optimized Gemini 3 model from Google DeepMind with frontier intelligence. Combines high performance with lower cost and latency. 1M token context window.

Get Started

Model Specs

Released2025-12-17
Context1M
ArchitectureDecoder Only
Knowledge cutoff2025-01

Provider

GCP Vertex AI
GCP Vertex AI

Google Cloud Platform (GCP)

All models on GCP Vertex AI