LLM Reference

Gemini 2.5 Flash

ProprietaryMultimodal

About

Best price-performance Gemini 2.5 model for low-latency, high-volume tasks that require reasoning. First hybrid reasoning model supporting 1M token context with thinking budgets. $0.30 input / $2.50 output per 1M tokens.

Capabilities

MultimodalFunction CallingTool UseJSON Mode

Specifications

Released2025-06-17
Context1M
ArchitectureDecoder Only
Knowledge cutoff2025-01
Specializationgeneral
LicenseProprietary