Gemini 3 Flash
gemini-3-flash
Last refreshed 2026-05-17. Next refresh: weekly.
Gemini 3 Flash is worth evaluating for coding, rag, and agents when its provider route and context window match the workload.
Decision context: Agents task fit, 3 tracked provider routes, and research from 2026-05-17.
Use it for
- Teams evaluating coding, rag, and agents
- Workloads that can use a 1M context window
- Buyers comparing 3 tracked provider routes
Do not use it for
- Workloads where another current model has stronger sourced task evidence
Cheapest output
$3.00
GCP Vertex AI per 1M tokens
Provider routes
3
Tracked API hosts
Quality / dollar
Grade C
Ranked by benchmark score divided by cheapest output price
Freshness
2026-05-17
Researched 1d ago
Top use-case fit
Coding
Included by capability and metadata signals in the decision map.
RAG
Included by capability and metadata signals in the decision map.
Agents
Q/$ C1 relevant benchmark in the decision map.
Provider price ladder
| Provider | Input / 1M | Output / 1M | Route |
|---|---|---|---|
| GCP Vertex AI | $0.500 | $3.00 | Serverless |
| Google AI Studio | $0.500 | $3.00 | Serverless |
| Replicate API | $0.500 | $3.00 | Serverless |
Benchmark peer barsfor Agents
Migration checks
No linked migration route is available for this model yet.
About
Gemini 3 Flash is Google's speed-optimized Gemini 3 model, available in public preview via the Gemini API and Vertex AI. It supports text, image, audio, and video inputs with a 1M token context window and is priced at $0.50 per 1M input tokens and $3.00 per 1M output tokens.
Gemini 3 Flash has a 1M-token context window.
Gemini 3 Flash input tokens at $0.5/1M, output at $3/1M.
Capabilities
Benchmark Scores(3)
| Benchmark | Score | Version | Source |
|---|---|---|---|
| MMLU PRO | 88.6 | — | https://huggingface.co/spaces/TIGER-Lab/MMLU-Pro |
| τ-bench | 71.5 | τ-bench | https://taubench.com/ |
| Chatbot Arena | 1467.0 | — | https://arena.ai/leaderboard |