LLM Reference

Gemini 3.1 Models by Google DeepMind

Google DeepMindProprietaryHighlight
11 models2026Up to 1.05m ctxFrom $0.25/1M input

Details

ResearcherGoogle DeepMind
LicenseProprietary
Commercial useCommercial use: conditional
Models11
Released2026
Max context1.05m

Capabilities

Vision9 of 11 models
Multimodal9 of 11 models
Reasoning1 of 11 models
Function Calling7 of 11 models
Tool Use7 of 11 models
Structured Outputs8 of 11 models
Code Execution4 of 11 models

Links

Website

About

Google DeepMind's Gemini 3.1 family refines Gemini 3 Pro with improved thinking, enhanced token efficiency, and optimizations for software engineering and agentic workflows.

Current Variants

Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.

9 in view2 retired

Use when the workload needs image generation, 131k context, and reasoning.

2026-05image generation131k contextreasoning

Use when the workload needs 1.05m context, tool use, and function calling.

2026-051.05m contexttool usefunction calling

Use when the workload needs 1m context, tool use, and function calling.

2026-051m contexttool usefunction calling

Use when the workload needs 1m context, tool use, and function calling.

2026-041m contexttool usefunction calling

Use when the workload needs 1m context, tool use, and function calling.

2026-041m contexttool usefunction calling

Use when the workload needs audio and 16k context.

2026-04audio16k context

Use when the workload needs 1m context, tool use, and function calling.

2026-021m contexttool usefunction calling

Use when the workload needs 1m context, structured outputs, and multimodal inputs.

2026-011m contextstructured outputsmultimodal inputs

Use when the workload needs 128k context, tool use, and function calling.

2026-01128k contexttool usefunction calling

Release Timeline

5 release groups
2026-05
3 current
Gemini 3.1 Flash-Lite
1.05m contexttool usefunction calling
Current
Gemini 3.1 Flash-Lite
1m contexttool usefunction calling
Current
Nano Banana 2 (Gemini 3.1 Flash Image)
image generation131k contextreasoning
Current
2026-04
3 current
Current
Gemini Deep Research Max Preview
1m contexttool usefunction calling
Current
Gemini Deep Research Preview
1m contexttool usefunction calling
Current
2026-03
2 retired
Gemini 3.1 Flash Lite Preview
1m contexttool usefunction calling
Replaced
2026-02
1 current
Gemini 3.1 Pro Preview
1m contexttool usefunction calling
Current
2026-01
2 current
Gemini 3.1 Flash Live Preview
128k contexttool usefunction calling
Current
Gemini 3.1 Pro Preview Custom Tools
1m contextstructured outputsmultimodal inputs
Current

Replaced By

Keep for legacy integrations; evaluate Gemini 3.1 Flash-Lite before new work.

Keep for legacy integrations; evaluate Nano Banana 2 (Gemini 3.1 Flash Image) before new work.

Specifications(11 models)

Gemini 3.1 model specifications comparison
ModelReleasedContextVisionMultimodalReasoningFn CallingTool UseStructured OutputsCode Exec
Nano Banana 2 (Gemini 3.1 Flash Image)2026-05131kYesYesYesNoNoNoNo
Gemini 3.1 Flash-Lite2026-051.05mYesYesNoYesYesYesYes
Gemini 3.1 Flash-Lite2026-051mYesYesNoYesYesYesYes
Gemini Deep Research Preview2026-041mYesYesNoYesYesYesNo
Gemini Deep Research Max Preview2026-041mYesYesNoYesYesYesNo
Gemini 3.1 Flash TTS Preview2026-0416kNoNoNoNoNoNoNo
Gemini 3.1 Pro Preview2026-021mYesYesNoYesYesYesYes
Gemini 3.1 Pro Preview Custom Tools2026-011mYesYesNoNoNoYesNo
Gemini 3.1 Flash Live Preview2026-01128kYesYesNoYesYesYesNo

Frequently Asked Questions

What is Gemini 3.1 used for?
Gemini 3.1 is used for image generation, audio, and vision and multimodal work. The family description and listed model capabilities point to those workloads as the best fit.
How does Gemini 3.1 compare to Gemma 4?
Gemini 3.1 by Google DeepMind is strongest where you need image generation, while Gemma 4 by Google DeepMind is the closest related family to check for multimodal. Gemini 3.1 has 11 listed variants and reaches up to 1.05m context, while Gemma 4 reaches up to 256k context, so compare the specs and pricing tables before choosing a production model.
Which Gemini 3.1 model should I use?
For the lowest listed input price, start with Gemini 3.1 Flash Lite Preview through Google AI Studio at $0.25/1M input tokens. For the most capable/latest local choice, evaluate Gemini 3.1 Flash-Lite with 1.05m context and tool use, function calling, structured outputs, and multimodal inputs.