LLM Reference

Gemma 2 Models by Google DeepMind

Google DeepMindGemmaOpen weightsHighlight
8 models2024–2025Up to 8k ctxFrom $0.06/1M input

Details

ResearcherGoogle DeepMind
LicenseGemma
Commercial useCommercial use with conditions
Models8
Released2024–2025
Max context8k

Capabilities

Structured Outputs4 of 8 models

About

Gemma 2 is a series of cutting-edge, lightweight open large language models developed by Google. Leveraging the same foundational research as the Gemini models, Gemma 2 offers models with 2 billion, 9 billion, and 27 billion parameters. These decoder-only text-to-text models, primarily trained on English data, demonstrate strong capabilities in multilingual tasks. They come in both pre-trained and instruction-tuned versions, making them versatile for diverse text generation applications such as question answering, summarization, and reasoning. Smaller models are optimized for deployment on resource-limited devices, while the larger variants deliver competitive performance with efficiency innovations like alternating local and global attention, logit soft-capping, and grouped-query attention12. Additionally, Gemma 2 includes tools for facilitating responsible AI development3.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

8 in view

Use when the workload needs 8k context and 9B parameters.

2025-018k context9B parameters
Gemma 2 2BCurrent

Use when the workload needs 8k context and 2B parameters.

2024-078k context2B parameters

Use when the workload needs 8k context and 2B parameters.

2024-078k context2B parameters

Use when the workload needs safety, 8k context, and 9B parameters.

2024-07safety8k context9B parameters

Use when the workload needs 8k context, 27B parameters, and structured outputs.

2024-068k context27B parametersstructured outputs

Use when the workload needs 8k context, 9B parameters, and structured outputs.

2024-068k context9B parametersstructured outputs

Use when the workload needs 8k context, 27B parameters, and structured outputs.

2024-068k context27B parametersstructured outputs
Gemma 2 9BCurrent

Use when the workload needs 8k context, 9B parameters, and structured outputs.

2024-068k context9B parametersstructured outputs

Release Timeline

3 release groups
2025-01
1 current
Gemma 2 9B SahabatAI Instruct
8k context9B parameters
Current
2024-07
3 current
Gemma 2 2B
8k context2B parameters
Current
Gemma 2 2B Instruct
8k context2B parameters
Current
ShieldGemma 9B
safety8k context9B parameters
Current
2024-06
4 current
Gemma 2 27B
8k context27B parametersstructured outputs
Current
Gemma 2 27B Instruct
8k context27B parametersstructured outputs
Current
Gemma 2 9B
8k context9B parametersstructured outputs
Current
Gemma 2 9B Instruct
8k context9B parametersstructured outputs
Current

Specifications(8 models)

Gemma 2 model specifications comparison
ModelReleasedContextParametersStructured Outputs
Gemma 2 9B SahabatAI Instruct2025-018k9BNo
Gemma 2 2B2024-078k2BNo
Gemma 2 2B Instruct2024-078k2BNo
ShieldGemma 9B2024-078k9BNo
Gemma 2 27B Instruct2024-068k27BYes
Gemma 2 9B Instruct2024-068k9BYes
Gemma 2 27B2024-068k27BYes
Gemma 2 9B2024-068k9BYes

Pricing

Gemma 2 model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Gemma 2 9BGCP Vertex AI$0.06$0.18Serverless
Gemma 2 9BBitdeer AI$0.08$0.24Serverless
Gemma 2 27BBitdeer AI$0.08$0.24Serverless
Gemma 2 9B InstructChutes AI$0.1$0.3Serverless
Gemma 2 9B InstructReplicate API$0.1$0.1Serverless
Gemma 2 9B InstructFireworks AI$0.2$0.2Serverless
Gemma 2 9BFireworks AI$0.2$0.2Serverless
Gemma 2 27B InstructArcee AI$0.25$0.75Serverless
Gemma 2 27BGCP Vertex AI$0.3$0.9Serverless
Gemma 2 27B InstructReplicate API$0.4$0.4Serverless
Gemma 2 27B InstructOpenRouter$0.65$0.65Serverless
Gemma 2 27B InstructFireworks AI$0.9$0.9Serverless

Frequently Asked Questions

What is Gemma 2 used for?
Gemma 2 is used for safety, structured outputs, and coding. The family description and listed model capabilities point to those workloads as the best fit.
How does Gemma 2 compare to Gemma 4?
Gemma 2 by Google DeepMind is strongest where you need safety, while Gemma 4 by Google DeepMind is the closest related family to check for multimodal. Gemma 2 has 8 listed variants and reaches up to 8k context, while Gemma 4 reaches up to 256k context, so compare the specs and pricing tables before choosing a production model.
Which Gemma 2 model should I use?
For the lowest listed input price, start with Gemma 2 9B through GCP Vertex AI at $0.06/1M input tokens. For the most capable/latest local choice, evaluate Gemma 2 27B Instruct with 8k context and structured outputs.