RecurrentGemma 2B

Name: RecurrentGemma 2B
Author: Google DeepMind

Released

2024-04-09

Last refreshed

2026-05-19

Status

Researched 69d ago

DeprecatedOpen weightsCommercial use: conditional

RecurrentGemma 2B is a legacy integration reference; keep it only while you identify a current replacement.

Use it for

Teams maintaining an existing integration
Workloads that can use a 4k context window
Buyers comparing 1 tracked provider route

Do not use it for

New production launches
Vision or document-understanding workloads
Strict JSON or tool-calling flows

Specifications

Family: RecurrentGemma
Released: 2024-04-09
Context: 4k
Parameters: 2B
Architecture: Decoder Only
Specialization: general
Openness: Open weights
License: GemmaCommercial use: conditional
Weights: Unknown
Code: Unknown
Training: Fine-tuned

Created by

Google DeepMind

Pioneering artificial intelligence research.

London, United Kingdom

Founded 2014

Website

Pricing

Output / 1M

Input / 1M

Cheapest of 1 route · NVIDIA NIM

Providers(1)

NVIDIA NIM

View 1 provider route

About

RecurrentGemma 2B, developed by Google, leverages a novel Griffin architecture that integrates linear recurrences with local attention mechanisms to adeptly manage long sequences while minimizing memory usage. It excels in text generation tasks, such as question answering, summarization, and reasoning, by efficiently handling complex prompts and instructions. Available in both pre-trained and instruction-tuned versions, RecurrentGemma enhances usability in interactive applications like chatbots. Its open-source nature fosters transparency, enabling researchers to explore and innovate further. Performance-wise, it stands out with competitive results on benchmarks like HellaSwag and PIQA, marking a notable leap in natural language processing capabilities.

RecurrentGemma 2B is an open-weight model in the RecurrentGemma family. The structured metadata tracks a 4k-token context window. This page tracks provider routes through NVIDIA NIM. No headline benchmark score is tracked for RecurrentGemma 2B yet.

Top use-case fit

No primary decision-task fit is mapped for this model yet.

Provider price ladder

Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Route
NVIDIA NIM	-	-	ProvisionedPartial

Available via routers & gateways(1)

NVIDIA LLM Router Blueprint

Router

NVIDIA's open-source AI blueprint for LLM routing that selects the optimal model per prompt via intent classification or neural auto-routing; being deprecated 2026-06-20.

Free OSSNVIDIA NIM