LLM ReferenceLLM Reference

Gemini 2.0 Flash-Lite

gemini-2.0-flash-lite

DeprecatedResearched 29d ago

Last refreshed 2026-05-11. Next refresh: weekly.

ProprietaryMultimodalRAGAgentsLong contextVisionJSON / Tool use

Gemini 2.0 Flash-Lite is a legacy integration reference; keep it only while you identify a current replacement.

Decision context: RAG task fit, 2 tracked provider routes, and research from 2026-04-19.

Use it for

  • Teams maintaining an existing integration
  • Workloads that can use a 1.048576M context window
  • Buyers comparing 2 tracked provider routes

Do not use it for

  • New production launches

Cheapest output

$0.300

GCP Vertex AI per 1M tokens

Provider routes

2

Tracked API hosts

Quality / dollar

Unknown

No task benchmark coverage yet

Freshness

2026-04-19

Researched 29d ago

fresh

This API model is marked deprecated. Verify the replacement path before sending new production traffic.

Top use-case fit

RAG

Included by capability and metadata signals in the decision map.

Agents

Included by capability and metadata signals in the decision map.

Long context

Included by capability and metadata signals in the decision map.

Provider price ladder

ProviderInput / 1MOutput / 1MRoute
GCP Vertex AI$0.075$0.300
Serverless
Google AI Studio$0.075$0.300
Serverless

Benchmark peer barsfor RAG

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

About

Deprecated, shut down June 1, 2026. AI Studio: $0.075/$0.30. Free tier available.

Gemini 2.0 Flash-Lite has a 1.0486M-token context window.

Gemini 2.0 Flash-Lite input tokens at $0.075/1M, output at $0.3/1M.

Capabilities

VisionMultimodalFunction CallingTool UseStructured Outputs

Rankings

Specifications

Released2025-02-12
Context1.048576M
ArchitectureDecoder Only
Specializationgeneral
LicenseProprietary
Trainingpretrained

Created by

Pioneering artificial intelligence research.

London, United Kingdom
Founded 2014
Website