Gemma
About
The Gemma family of large language models (LLMs) represents a series of advanced open models developed by Google. These lightweight models harness the cutting-edge research and technologies utilized in the Gemini models and are tailored for diverse natural language processing tasks. Gemma offers two model sizes: a 2 billion parameter version compatible with CPU and on-device environments, and a 7 billion parameter model primed for GPU and TPU platforms. Both sizes come in pre-trained and instruction-tuned forms, ensuring flexibility in their deployment. Designed for accessibility, the models support major AI frameworks and hardware platforms, embodying Google's commitment to responsible AI development with integrated safety measures and risk mitigation tools 1 5 6.
Specifications(12 models)
| Model | Released | Context | Parameters | Structured Outputs |
|---|---|---|---|---|
| Gemma 7B Instruct | 2024-02 | 8K | 7B | Yes |
| Gemma 1.1 7B Instruct | 2024-02 | 8K | 7B | Yes |
| Gemma 1.1 2B Instruct | 2024-02 | 2K | 2B | No |
| Gemma 7B | 2024-02 | 8K | 7B | Yes |
| Gemma 2B | 2024-02 | 2K | 2B | No |
| Together AI Gemma-7B-it | 2024-02 | 8K | 7B | Yes |
| OctoML Gemma-7B-it | 2024-02 | 8K | 7B | No |
| OctoML Gemma-2B-it | 2024-02 | 8K | 2B | No |
| Gemma 7B on Google Vertex AI | 2024-02 | 8K | 7B | Yes |
| DeepInfra Google Gemma 7B | 2024-02 | 8K | 7B | Yes |
| DeepInfra Google Gemma 2B | 2024-02 | 8K | 2B | Yes |
Available From(10 providers)
Pricing
| Model | Provider | Input / 1M | Output / 1M | Type |
|---|---|---|---|---|
| Gemma 1.1 7B Instruct | DeepInfra | $0.05 | $0.15 | Serverless |
| DeepInfra Google Gemma 7B | DeepInfra | $0.05 | $0.15 | Serverless |
| DeepInfra Google Gemma 2B | DeepInfra | $0.05 | $0.15 | Serverless |
| Gemma 7B Instruct | Replicate API | $0.05 | $0.25 | Serverless |
| Gemma 7B Instruct | Lepton AI API | $0.07 | $0.07 | Serverless |
| Gemma 7B Instruct | GCP Vertex AI | $0.1 | $0.3 | Serverless |
| OctoML Gemma-2B-it | OctoML | $0.1 | $0.15 | Serverless |
| Gemma 7B | GCP Vertex AI | $0.1 | $0.3 | Serverless |
| Gemma 7B on Google Vertex AI | GCP Vertex AI | $0.125 | $0.375 | Serverless |
| Together AI Gemma-7B-it | Together AI | $0.15 | $0.15 | Serverless |
| OctoML Gemma-7B-it | OctoML | $0.15 | $0.2 | Serverless |
| Gemma 7B Instruct | Fireworks AI | $0.2 | $0.2 | Provisioned |
| Gemma 7B Instruct | Together AI | $0.2 | $0.2 | Serverless |
| Gemma 7B | Fireworks AI | $0.2 | $0.2 | Serverless |
Frequently Asked Questions
- What is Gemma?
- The Gemma family of large language models (LLMs) represents a series of advanced open models developed by Google. These lightweight models harness the cutting-edge research and technologies utilized in the Gemini models and are tailored for diverse natural language processing tasks. Gemma offers two model sizes: a 2 billion parameter version compatible with CPU and on-device environments, and a 7 billion parameter model primed for GPU and TPU platforms. Both sizes come in pre-trained and instruction-tuned forms, ensuring flexibility in their deployment. Designed for accessibility, the models support major AI frameworks and hardware platforms, embodying Google's commitment to responsible AI development with integrated safety measures and risk mitigation tools 1 5 6.
- How many models are in the Gemma family?
- The Gemma family contains 12 models.
- What is the latest Gemma model?
- The latest model is Gemma 7B Instruct, released in 2024-02.
- How much does Gemma cost?
- Gemma models range from $0.04/1M to $0.2/1M input tokens depending on the model and provider.
Models(12)
Gemma 7B Instruct
Gemma 1.1 7B Instruct
Gemma 1.1 2B Instruct
Gemma 7B
Gemma 2B
Together AI Gemma-7B-it
OctoML Gemma-7B-it
OctoML Gemma-2B-it
Gemma 7B on Google Vertex AI
DeepInfra Google Gemma 7B
DeepInfra Google Gemma 2B






