Using Gemini 3.1 Flash-Lite on OpenRouter
Implementation guide · Gemini 3.1 · Google DeepMind
Quick Start
- 1
- 2Use the OpenRouter SDK or REST API to call
google/gemini-3.1-flash-lite— see the documentation for request format. - 3
Code Examples
About OpenRouter
OpenRouter provides a unified interface for Large Language Models with better pricing, improved uptime, and no subscription requirements. Route across providers for cost optimization and reliability.
OpenRouter is a multi-provider LLM aggregator offering unified API access to 300+ models from all major labs and emerging providers, with automatic failover for reliability.
Pricing on OpenRouter
| Type | Price (per 1M) |
|---|---|
| Input tokens | $0.25 |
| Output tokens | $1.50 |
Capabilities
About Gemini 3.1 Flash-Lite
Gemini 3.1 Flash-Lite is Google's generally available low-latency Gemini 3.1 model, launched May 7, 2026. It is optimized for high-volume, cost-sensitive workloads with text, image, and video inputs, a 1M token context window, and a 66K token maximum output. The GA model uses the stable API ID gemini-3.1-flash-lite and replaces gemini-3.1-flash-lite-preview, which is scheduled to shut down on May 25, 2026. Pricing is $0.25 per 1M input tokens and $1.50 per 1M output tokens.