LLM ReferenceLLM Reference
NVIDIA NIM

RecurrentGemma 2B on NVIDIA NIM

RecurrentGemma · Google DeepMind

ProvisionedOpen Source

Last refreshed 2026-05-01. Next refresh: weekly.

Why use RecurrentGemma 2B on NVIDIA NIM?

NVIDIA NIM offers RecurrentGemma 2B with competitive pricing. NVIDIA NIM is NVIDIA's deployment platform for GPU-accelerated inference microservices.

Input / 1M
-
Output / 1M
-
Cache
Not sourced
Batch
Not sourced

Setup recipe

Docs fallback
Install
Use the provider REST API or SDK
Auth
Create a provider API key
Call
model: recurrentgemma-2b
Model ID
recurrentgemma-2b

Request example

Curated snippets for this provider are not sourced yet. Use NVIDIA NIM documentation with model ID recurrentgemma-2b.

Gotchas

No curated gotchas have been sourced for this exact provider/model route yet.

Pricing

TypeRate
GPU Hour Rate$1.00/GPU·hr
GPU Config1xH100

Capabilities

No model capability flags are currently sourced.

About RecurrentGemma 2B

RecurrentGemma 2B, developed by Google, leverages a novel Griffin architecture that integrates linear recurrences with local attention mechanisms to adeptly manage long sequences while minimizing memory usage. It excels in text generation tasks, such as question answering, summarization, and reasoning, by efficiently handling complex prompts and instructions. Available in both pre-trained and instruction-tuned versions, RecurrentGemma enhances usability in interactive applications like chatbots. Its open-source nature fosters transparency, enabling researchers to explore and innovate further. Performance-wise, it stands out with competitive results on benchmarks like HellaSwag and PIQA, marking a notable leap in natural language processing capabilities.

FAQ

Who created RecurrentGemma 2B?

RecurrentGemma 2B was created by Google DeepMind as part of the RecurrentGemma model family.

Is RecurrentGemma 2B open source?

RecurrentGemma 2B is open source according to the seed data.

Get Started

Model Specs

Released2024-04-09
Parameters2B
ArchitectureDecoder Only