LLM Reference
GCP Vertex AI

Gemini 1.5 Flash on GCP Vertex AI

Gemini 1.5 · Google DeepMind

Serverless

Pricing

TypePrice (per 1M)
Input tokens$0.50
Output tokens$1.50
Image input$0.13
Video input$0.47
Audio input$0.04

Capabilities

VisionMultimodalReasoningFunction CallingTool UseJSON ModeCode Execution

About Gemini 1.5 Flash

Gemini 1.5 Flash is a large language AI model by Google, crafted for speed and efficiency in high-volume scenarios 145. As a lightweight model, it's optimized for fast processing and cost-effectiveness, making it ideal for real-time applications and high-frequency tasks 567. With its multimodal capabilities, Gemini 1.5 Flash effectively processes and reasons across multiple data types, including text, images, audio, video, and PDFs 145. Despite its smaller size compared to Gemini 1.5 Pro, it excels in tasks like summarization, chat applications, and data extraction from lengthy documents, employing "knowledge distillation" to transfer essential knowledge from larger models 5. Additionally, it features an extensive context window of up to 1 million tokens, allowing it to manage large information volumes effectively 456.

Get Started