LLM Reference
Replicate API

Gemini 3.1 Pro on Replicate API

Gemini 3.1 · Google DeepMind

Serverless

Pricing

TypePrice (per 1M)
Input tokens$2.00
Output tokens$12.00

Capabilities

VisionMultimodalReasoningFunction CallingTool UseJSON ModeCode Execution

About Gemini 3.1 Pro

Google DeepMind's Gemini 3.1 Pro preview model (released Feb 19, 2026), refining Gemini 3 Pro with better thinking, improved token efficiency, enhanced factual consistency, and optimizations for software engineering and agentic workflows. Supports text, image, video, audio, and PDF inputs; text output only. 1M token input context window (65,536 output max). Knowledge cutoff: January 2025. Includes batch API, caching, code execution, function calling, search grounding, structured outputs, and thinking capabilities.

Get Started

Model Specs

Released2026-02-19
Context1M
ArchitectureDecoder Only
Knowledge cutoff2025-01