LLM ReferenceLLM Reference
OpenRouter

Using Gemini 3.1 Flash-Lite on OpenRouter

Implementation guide · Gemini 3.1 · Google DeepMind

Serverless

Quick Start

  1. 1
    Create an account at OpenRouter and generate an API key.
  2. 2
    Use the OpenRouter SDK or REST API to call google/gemini-3.1-flash-lite — see the documentation for request format.
  3. 3
    You'll be billed $0.25/1M input, $1.50/1M output tokens. See full pricing.

Code Examples

See OpenRouter documentation for integration details.

About OpenRouter

OpenRouter provides a unified interface for Large Language Models with better pricing, improved uptime, and no subscription requirements. Route across providers for cost optimization and reliability.

OpenRouter is a multi-provider LLM aggregator offering unified API access to 300+ models from all major labs and emerging providers, with automatic failover for reliability.

Pricing on OpenRouter

TypePrice (per 1M)
Input tokens$0.25
Output tokens$1.50

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution

About Gemini 3.1 Flash-Lite

Gemini 3.1 Flash-Lite is Google's generally available low-latency Gemini 3.1 model, launched May 7, 2026. It is optimized for high-volume, cost-sensitive workloads with text, image, and video inputs, a 1M token context window, and a 66K token maximum output. The GA model uses the stable API ID gemini-3.1-flash-lite and replaces gemini-3.1-flash-lite-preview, which is scheduled to shut down on May 25, 2026. Pricing is $0.25 per 1M input tokens and $1.50 per 1M output tokens.

Model Specs

Released2026-05-07
Context1M
ArchitectureDecoder Only
Knowledge cutoff2025-01

Provider

OpenRouter
OpenRouter

OpenRouter, Inc.

New York, NY, USA