Gemini 3.1 Flash Lite Preview
gemini-3.1-flash-lite-preview
Last refreshed 2026-05-14. Next refresh: weekly.
Gemini 3.1 Flash Lite Preview is a legacy integration reference; evaluate Gemini 3.1 Flash-Lite before starting new work.
Decision context: Agents task fit, 3 tracked provider routes, and research from 2026-05-14.
Use it for
- Teams maintaining an existing integration
- Workloads that can use a 1M context window
- Buyers comparing 3 tracked provider routes
Do not use it for
- New production launches
Cheapest output
$1.50
GCP Vertex AI per 1M tokens
Provider routes
3
Tracked API hosts
Quality / dollar
Unknown
No output-token price in the ladder
Freshness
2026-05-14
Researched 4d ago
This API model is marked deprecated. Use Gemini 3.1 Flash-Lite as the replacement candidate before sending new production traffic.
Top use-case fit
Coding
Included by capability and metadata signals in the decision map.
RAG
Included by capability and metadata signals in the decision map.
Agents
1 relevant benchmark in the decision map.
Provider price ladder
| Provider | Input / 1M | Output / 1M | Route |
|---|---|---|---|
| GCP Vertex AI | $0.250 | $1.50 | Serverless |
| Google AI Studio | $0.250 | $1.50 | Serverless |
| OpenRouter | $0.250 | $1.50 | Serverless |
Benchmark peer barsfor Agents
Migration checks
No linked migration route is available for this model yet.
About
Preview-stage Gemini 3.1 Flash-Lite model deprecated on 2026-05-11 with shutdown on 2026-05-25; use gemini-3.1-flash-lite GA instead.
Gemini 3.1 Flash Lite Preview has a 1M-token context window.
Gemini 3.1 Flash Lite Preview input tokens at $0.25/1M, output at $1.5/1M.
Capabilities
Benchmark Scores(1)
| Benchmark | Score | Version | Source |
|---|---|---|---|
| MultiChallenge | 60.6 | MultiChallenge | https://labs.scale.com/leaderboard/multichallenge |