LLM ReferenceLLM Reference

Gemini 1.5 Flash

About

Gemini 1.5 Flash is a large language AI model by Google, crafted for speed and efficiency in high-volume scenarios 145. As a lightweight model, it's optimized for fast processing and cost-effectiveness, making it ideal for real-time applications and high-frequency tasks 567. With its multimodal capabilities, Gemini 1.5 Flash effectively processes and reasons across multiple data types, including text, images, audio, video, and PDFs 145. Despite its smaller size compared to Gemini 1.5 Pro, it excels in tasks like summarization, chat applications, and data extraction from lengthy documents, employing "knowledge distillation" to transfer essential knowledge from larger models 5. Additionally, it features an extensive context window of up to 1 million tokens, allowing it to manage large information volumes effectively 456.

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution

Providers(2)

Compare all →
ProviderInput (per 1M)Output (per 1M)Type
GCP Vertex AI$0.5$1.5Serverless
Google AI StudioServerless

Benchmark Scores(1)

BenchmarkScoreVersionSource
MMLU PRO59.1https://huggingface.co/spaces/TIGER-Lab/MMLU-Pro

Rankings

Specifications

Released2024-05-14
Context1M
ArchitectureDecoder Only
Specializationgeneral
Trainingfinetuning

Created by

Pioneering artificial intelligence research.

London, United Kingdom
Founded 2014
Website