Compare Gemini 3 Flash Across Providers
| Provider | Input (per 1M) | Output (per 1M) |
|---|---|---|
| Replicate API | $0.50 | $3.00 |
| GCP Vertex AI | $0.10 | $0.40 |
Pricing
| Type | Price (per 1M) |
|---|---|
| Input tokens | $0.10 |
| Output tokens | $0.40 |
Capabilities
VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution
About Gemini 3 Flash
Speed-optimized Gemini 3 model from Google DeepMind with frontier intelligence. Combines high performance with lower cost and latency. 1M token context window.
Model Specs
Released2025-12-17
Context1M
ArchitectureDecoder Only
Knowledge cutoff2025-01