LLM ReferenceLLM Reference

Gemma 2

8 models2024–2025Up to 8K ctxFrom $0.03/1M input

About

Gemma 2 is a series of cutting-edge, lightweight open large language models developed by Google. Leveraging the same foundational research as the Gemini models, Gemma 2 offers models with 2 billion, 9 billion, and 27 billion parameters. These decoder-only text-to-text models, primarily trained on English data, demonstrate strong capabilities in multilingual tasks. They come in both pre-trained and instruction-tuned versions, making them versatile for diverse text generation applications such as question answering, summarization, and reasoning. Smaller models are optimized for deployment on resource-limited devices, while the larger variants deliver competitive performance with efficiency innovations like alternating local and global attention, logit soft-capping, and grouped-query attention12. Additionally, Gemma 2 includes tools for facilitating responsible AI development3.

Specifications(8 models)

Gemma 2 model specifications comparison
ModelReleasedContextParametersStructured Outputs
Gemma 2 9B SahabatAI Instruct2025-018K9BNo
Gemma 2 2B2024-072BNo
Gemma 2 2B Instruct2024-072BNo
ShieldGemma 9B2024-078K9BNo
Gemma 2 27B Instruct2024-068K27BYes
Gemma 2 9B Instruct2024-068K9BYes
Gemma 2 27B2024-068K27BYes
Gemma 2 9B2024-068K9BYes

Available From(8 providers)

Pricing

Gemma 2 model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Gemma 2 9B InstructOpenRouter$0.03$0.09Serverless
Gemma 2 9BGCP Vertex AI$0.06$0.18Serverless
Gemma 2 9BBitdeer AI$0.08$0.24Serverless
Gemma 2 27BBitdeer AI$0.08$0.24Serverless
Gemma 2 9B InstructChutes AI$0.1$0.3Serverless
Gemma 2 9B InstructReplicate API$0.1$0.1Serverless
Gemma 2 9B InstructFireworks AI$0.2$0.2Serverless
Gemma 2 9BFireworks AI$0.2$0.2Serverless
Gemma 2 27B InstructArcee AI$0.25$0.75Serverless
Gemma 2 27BGCP Vertex AI$0.3$0.9Serverless
Gemma 2 27B InstructReplicate API$0.4$0.4Serverless
Gemma 2 27B InstructOpenRouter$0.65$0.65Serverless
Gemma 2 27B InstructFireworks AI$0.9$0.9Serverless

Frequently Asked Questions

What is Gemma 2?
Gemma 2 is a series of cutting-edge, lightweight open large language models developed by Google. Leveraging the same foundational research as the Gemini models, Gemma 2 offers models with 2 billion, 9 billion, and 27 billion parameters. These decoder-only text-to-text models, primarily trained on English data, demonstrate strong capabilities in multilingual tasks. They come in both pre-trained and instruction-tuned versions, making them versatile for diverse text generation applications such as question answering, summarization, and reasoning. Smaller models are optimized for deployment on resource-limited devices, while the larger variants deliver competitive performance with efficiency innovations like alternating local and global attention, logit soft-capping, and grouped-query attention12. Additionally, Gemma 2 includes tools for facilitating responsible AI development3.
How many models are in the Gemma 2 family?
The Gemma 2 family contains 8 models.
What is the latest Gemma 2 model?
The latest model is Gemma 2 9B SahabatAI Instruct, released in 2025-01.
How much does Gemma 2 cost?
Gemma 2 models range from $0.03/1M to $0.9/1M input tokens depending on the model and provider.
Is Gemma 2 open source?
2 of 8 Gemma 2 models are open source.

Models(8)