Gemini 3.1 Models by Google DeepMind
Details
Capabilities
Links
WebsiteAbout
Google DeepMind's Gemini 3.1 family refines Gemini 3 Pro with improved thinking, enhanced token efficiency, and optimizations for software engineering and agentic workflows.
Current Variants
Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.
Use when the workload needs image generation, 131k context, and reasoning.
Use when the workload needs 1.05m context, tool use, and function calling.
Use when the workload needs 1m context, tool use, and function calling.
Use when the workload needs 1m context, tool use, and function calling.
Use when the workload needs 1m context, tool use, and function calling.
Use when the workload needs audio and 16k context.
Use when the workload needs 1m context, tool use, and function calling.
Use when the workload needs 1m context, structured outputs, and multimodal inputs.
Use when the workload needs 128k context, tool use, and function calling.
| Model | Use when | Released | Signals | Status |
|---|---|---|---|---|
| Nano Banana 2 (Gemini 3.1 Flash Image) | Use when the workload needs image generation, 131k context, and reasoning. | 2026-05 | image generation131k contextreasoning | Current |
| Gemini 3.1 Flash-Lite | Use when the workload needs 1.05m context, tool use, and function calling. | 2026-05 | 1.05m contexttool usefunction calling | Current |
| Gemini 3.1 Flash-Lite | Use when the workload needs 1m context, tool use, and function calling. | 2026-05 | 1m contexttool usefunction calling | Current |
| Gemini Deep Research Preview | Use when the workload needs 1m context, tool use, and function calling. | 2026-04 | 1m contexttool usefunction calling | Current |
| Gemini Deep Research Max Preview | Use when the workload needs 1m context, tool use, and function calling. | 2026-04 | 1m contexttool usefunction calling | Current |
| Gemini 3.1 Flash TTS Preview | Use when the workload needs audio and 16k context. | 2026-04 | audio16k context | Current |
| Gemini 3.1 Pro Preview | Use when the workload needs 1m context, tool use, and function calling. | 2026-02 | 1m contexttool usefunction calling | Current |
| Gemini 3.1 Pro Preview Custom Tools | Use when the workload needs 1m context, structured outputs, and multimodal inputs. | 2026-01 | 1m contextstructured outputsmultimodal inputs | Current |
| Gemini 3.1 Flash Live Preview | Use when the workload needs 128k context, tool use, and function calling. | 2026-01 | 128k contexttool usefunction calling | Current |
Release Timeline
5 release groupsReplaced By
Keep for legacy integrations; evaluate Gemini 3.1 Flash-Lite before new work.
Keep for legacy integrations; evaluate Nano Banana 2 (Gemini 3.1 Flash Image) before new work.
Specifications(11 models)
| Model | Released | Context | Vision | Multimodal | Reasoning | Fn Calling | Tool Use | Structured Outputs | Code Exec |
|---|---|---|---|---|---|---|---|---|---|
| Nano Banana 2 (Gemini 3.1 Flash Image) | 2026-05 | 131k | Yes | Yes | Yes | No | No | No | No |
| Gemini 3.1 Flash-Lite | 2026-05 | 1.05m | Yes | Yes | No | Yes | Yes | Yes | Yes |
| Gemini 3.1 Flash-Lite | 2026-05 | 1m | Yes | Yes | No | Yes | Yes | Yes | Yes |
| Gemini Deep Research Preview | 2026-04 | 1m | Yes | Yes | No | Yes | Yes | Yes | No |
| Gemini Deep Research Max Preview | 2026-04 | 1m | Yes | Yes | No | Yes | Yes | Yes | No |
| Gemini 3.1 Flash TTS Preview | 2026-04 | 16k | No | No | No | No | No | No | No |
| Gemini 3.1 Pro Preview | 2026-02 | 1m | Yes | Yes | No | Yes | Yes | Yes | Yes |
| Gemini 3.1 Pro Preview Custom Tools | 2026-01 | 1m | Yes | Yes | No | No | No | Yes | No |
| Gemini 3.1 Flash Live Preview | 2026-01 | 128k | Yes | Yes | No | Yes | Yes | Yes | No |
Available From(5 providers)
Pricing
| Model | Provider | Input / 1M | Output / 1M | Type |
|---|---|---|---|---|
| Gemini 3.1 Flash-Lite | Google AI Studio | $0.25 | $1.5 | Serverless |
| Gemini 3.1 Flash-Lite | OpenRouter | $0.25 | $1.5 | Serverless |
| Gemini 3.1 Flash-Lite | Vercel AI Gateway | $0.25 | $1.5 | Serverless |
| Nano Banana 2 (Gemini 3.1 Flash Image) | Google AI Studio | $0.5 | $60 | Serverless |
| Gemini 3.1 Flash Live Preview | Google AI Studio | $0.75 | $4.5 | Serverless |
| Gemini 3.1 Flash TTS Preview | Google AI Studio | $1 | — | Serverless |
| Gemini 3.1 Pro Preview | Google AI Studio | $2 | $12 | Serverless |
| Gemini 3.1 Pro Preview | GCP Vertex AI | $2 | $12 | Serverless |
| Gemini 3.1 Pro Preview Custom Tools | OpenRouter | $2 | $12 | Serverless |
| Gemini 3.1 Pro Preview | OpenRouter | $2 | $12 | Serverless |
| Gemini Deep Research Preview | Google AI Studio | $2 | $12 | Serverless |
| Gemini Deep Research Max Preview | Google AI Studio | $2 | $12 | Serverless |
| Gemini 3.1 Pro Preview | Replicate API | $2 | $12 | Serverless |
| Gemini 3.1 Pro Preview | Vercel AI Gateway | $2 | $12 | Serverless |
Frequently Asked Questions
- What is Gemini 3.1 used for?
- Gemini 3.1 is used for image generation, audio, and vision and multimodal work. The family description and listed model capabilities point to those workloads as the best fit.
- How does Gemini 3.1 compare to Gemma 4?
- Gemini 3.1 by Google DeepMind is strongest where you need image generation, while Gemma 4 by Google DeepMind is the closest related family to check for multimodal. Gemini 3.1 has 11 listed variants and reaches up to 1.05m context, while Gemma 4 reaches up to 256k context, so compare the specs and pricing tables before choosing a production model.
- Which Gemini 3.1 model should I use?
- For the lowest listed input price, start with Gemini 3.1 Flash Lite Preview through Google AI Studio at $0.25/1M input tokens. For the most capable/latest local choice, evaluate Gemini 3.1 Flash-Lite with 1.05m context and tool use, function calling, structured outputs, and multimodal inputs.
Models(11)
Nano Banana 2 (Gemini 3.1 Flash Image)
Gemini 3.1 Flash-Lite
Gemini 3.1 Flash-Lite
Gemini Deep Research Preview
Gemini Deep Research Max Preview
Gemini 3.1 Flash TTS Preview
Gemini 3.1 Pro Preview
Gemini 3.1 Pro Preview Custom Tools
Gemini 3.1 Flash Live Preview



