LLM ReferenceLLM Reference

Gemini 3.1 Flash Live Preview

gemini-3.1-flash-live-preview

Researched 29d ago

Last refreshed 2026-05-11. Next refresh: weekly.

ProprietaryMultimodalRAGAgentsLong contextVisionJSON / Tool use

Gemini 3.1 Flash Live Preview is worth evaluating for rag, agents, and long context when its provider route and context window match the workload.

Decision context: RAG task fit, 1 tracked provider route, and research from 2026-04-19.

Use it for

  • Teams evaluating rag, agents, and long context
  • Workloads that can use a 128K context window
  • Buyers comparing 1 tracked provider route

Do not use it for

  • Workloads where another current model has stronger sourced task evidence

Cheapest output

$4.50

Google AI Studio per 1M tokens

Provider routes

1

Tracked API hosts

Quality / dollar

Unknown

No task benchmark coverage yet

Freshness

2026-04-19

Researched 29d ago

fresh

Top use-case fit

RAG

Included by capability and metadata signals in the decision map.

Agents

Included by capability and metadata signals in the decision map.

Long context

Included by capability and metadata signals in the decision map.

Provider price ladder

ProviderInput / 1MOutput / 1MRoute
Google AI Studio$0.750$4.50
Serverless

Benchmark peer barsfor RAG

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

About

Google Gemini 3.1 Flash live preview model optimized for real-time multimodal interactions.

Gemini 3.1 Flash Live Preview has a 128K-token context window.

Gemini 3.1 Flash Live Preview input tokens at $0.75/1M, output at $4.5/1M.

Capabilities

VisionMultimodalFunction CallingTool UseStructured Outputs

Rankings

Specifications

Released2026-01-01
Context128K
ArchitectureDecoder Only
Specializationgeneral
LicenseProprietary
Trainingpretrained

Created by

Pioneering artificial intelligence research.

London, United Kingdom
Founded 2014
Website

Providers(1)