LLM Reference
Replicate API

Gemini 3 Flash on Replicate API

Gemini 3 · Google DeepMind

Serverless

Pricing

TypePrice (per 1M)
Input tokens$0.50
Output tokens$3.00

Capabilities

VisionMultimodalReasoningFunction CallingTool UseJSON ModeCode Execution

About Gemini 3 Flash

Speed-optimized Gemini 3 model from Google DeepMind with frontier intelligence. Combines high performance with lower cost and latency. 1M token context window.

Get Started

Model Specs

Released2025-12-17
Context1M
ArchitectureDecoder Only
Knowledge cutoff2025-01

Related Models on Replicate API