Using Gemini 3.1 Flash-Lite on OpenRouter

Implementation guide · Gemini 3.1 · Google DeepMind

Serverless

Quick Start

1
Create an account at OpenRouter and generate an API key.
2
Use the OpenRouter SDK or REST API to call google/gemini-3.1-flash-lite — see the documentation for request format.
3
You'll be billed $0.25/1M input, $1.50/1M output tokens. See full pricing.

API Portal Documentation Pricing Model Card

Code Examples

See OpenRouter documentation for integration details.

About OpenRouter

OpenRouter provides a unified interface for Large Language Models with better pricing, improved uptime, and no subscription requirements. Route across providers for cost optimization and reliability.

OpenRouter is a multi-provider LLM aggregator offering unified API access to 300+ models from all major labs and emerging providers, with automatic failover for reliability.

View all models on OpenRouter →

Pricing on OpenRouter

Type	Price (per 1M)
Input tokens	$0.25
Output tokens	$1.50

Capabilities

VisionMultimodalFunction CallingTool UseStructured OutputsCode Execution

About Gemini 3.1 Flash-Lite

Gemini 3.1 Flash-Lite is Google's generally available low-latency Gemini 3.1 model, launched May 7, 2026. It is optimized for high-volume, cost-sensitive workloads with text, image, and video inputs, a 1M token context window, and a 66K token maximum output. The GA model uses the stable API ID gemini-3.1-flash-lite and replaces gemini-3.1-flash-lite-preview, which is scheduled to shut down on May 25, 2026. Pricing is $0.25 per 1M input tokens and $1.50 per 1M output tokens.

Full model details →

Model Specs

Released2026-05-07

Context1.05m

ArchitectureDecoder Only

Knowledge cutoff2025-01

OpenRouter, Inc.

New York, NY, USA