LLM ReferenceLLM Reference
GCP Vertex AI

Gemini 1.5 Flash 8B on GCP Vertex AI

Gemini 1.5 · Google DeepMind

Serverless

Pricing

TypePrice (per 1M)
Input tokens$0.04
Output tokens$0.15

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution

About Gemini 1.5 Flash 8B

Lightweight 8B variant of Gemini 1.5 Flash optimized for speed and cost-efficiency. Supports 1M token context with fast inference for real-time applications.

Get Started