LLM Reference

RecurrentGemma Models by Google DeepMind

Google DeepMindGemmaOpen weights
2 models2024Up to 4k ctx

Details

ResearcherGoogle DeepMind
LicenseGemma
Commercial useCommercial use with conditions
Models2
Released2024
Max context4k

About

RecurrentGemma is a family of open-weight language models developed by Google DeepMind, known for their cutting-edge Griffin architecture. This hybrid design blends linear recurrences with local attention mechanisms, allowing the models to excel in a range of language tasks with reduced memory overhead and efficient inference, especially on lengthy sequences. Unlike traditional transformer models that require memory scaling linearly with sequence length, RecurrentGemma maintains a fixed-sized state, resulting in faster processing speeds. Both pre-trained and instruction-tuned variants are available, the latter being tailored for tasks like dialogue and instruction following. Accessible through platforms like Hugging Face and Kaggle, RecurrentGemma-2B achieves performance akin to Gemma-2B despite being trained on fewer tokens, demonstrating its efficiency and versatility 23910.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

1 in view1 retired

Use when the workload needs 4k context and 9B parameters.

2024-064k context9B parameters

Release Timeline

2 release groups
2024-06
1 current
RecurrentGemma 9B
4k context9B parameters
Current
2024-04
1 retired
RecurrentGemma 2B
4k context2B parameters
Archived

Specifications(2 models)

RecurrentGemma model specifications comparison
ModelReleasedContextParameters
RecurrentGemma 9B2024-064k9B

Available From(1 provider)

Frequently Asked Questions

What is RecurrentGemma used for?
RecurrentGemma is used for chatbot and role-playing use cases. The family description and listed model capabilities point to those workloads as the best fit.
How does RecurrentGemma compare to Gemma 4?
RecurrentGemma by Google DeepMind is strongest where you need chatbot and role-playing use cases, while Gemma 4 by Google DeepMind is the closest related family to check for multimodal. RecurrentGemma has 2 listed variants and reaches up to 4k context, while Gemma 4 reaches up to 256k context, so compare the specs and pricing tables before choosing a production model.
Which RecurrentGemma model should I use?
If price is the main constraint, use the pricing table first because RecurrentGemma does not have complete provider pricing in the local data. For the most capable/latest local choice, evaluate RecurrentGemma 9B with 4k context.

Models(2)