LLM ReferenceLLM Reference

Gemma

Google DeepMindGemmaOpen SourceHighlight
12 models2024Up to 8K ctxFrom $0.04/1M input

About

The Gemma family of large language models (LLMs) represents a series of advanced open models developed by Google. These lightweight models harness the cutting-edge research and technologies utilized in the Gemini models and are tailored for diverse natural language processing tasks. Gemma offers two model sizes: a 2 billion parameter version compatible with CPU and on-device environments, and a 7 billion parameter model primed for GPU and TPU platforms. Both sizes come in pre-trained and instruction-tuned forms, ensuring flexibility in their deployment. Designed for accessibility, the models support major AI frameworks and hardware platforms, embodying Google's commitment to responsible AI development with integrated safety measures and risk mitigation tools 1 5 6.

Specifications(12 models)

Gemma model specifications comparison
ModelReleasedContextParametersStructured Outputs
Gemma 7B Instruct2024-028K7BYes
Gemma 1.1 7B Instruct2024-028K7BYes
Gemma 1.1 2B Instruct2024-022K2BNo
Gemma 7B2024-028K7BYes
Gemma 2B2024-022K2BNo
Together AI Gemma-7B-it2024-028K7BYes
OctoML Gemma-7B-it2024-028K7BNo
OctoML Gemma-2B-it2024-028K2BNo
Gemma 7B on Google Vertex AI2024-028K7BYes
DeepInfra Google Gemma 7B2024-028K7BYes
DeepInfra Google Gemma 2B2024-028K2BYes

Available From(10 providers)

Pricing

Gemma model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Gemma 1.1 7B InstructDeepInfra$0.05$0.15Serverless
DeepInfra Google Gemma 7BDeepInfra$0.05$0.15Serverless
DeepInfra Google Gemma 2BDeepInfra$0.05$0.15Serverless
Gemma 7B InstructReplicate API$0.05$0.25Serverless
Gemma 7B InstructLepton AI API$0.07$0.07Serverless
Gemma 7B InstructGCP Vertex AI$0.1$0.3Serverless
OctoML Gemma-2B-itOctoML$0.1$0.15Serverless
Gemma 7BGCP Vertex AI$0.1$0.3Serverless
Gemma 7B on Google Vertex AIGCP Vertex AI$0.125$0.375Serverless
Together AI Gemma-7B-itTogether AI$0.15$0.15Serverless
OctoML Gemma-7B-itOctoML$0.15$0.2Serverless
Gemma 7B InstructFireworks AI$0.2$0.2Provisioned
Gemma 7B InstructTogether AI$0.2$0.2Serverless
Gemma 7BFireworks AI$0.2$0.2Serverless

Frequently Asked Questions

What is Gemma?
The Gemma family of large language models (LLMs) represents a series of advanced open models developed by Google. These lightweight models harness the cutting-edge research and technologies utilized in the Gemini models and are tailored for diverse natural language processing tasks. Gemma offers two model sizes: a 2 billion parameter version compatible with CPU and on-device environments, and a 7 billion parameter model primed for GPU and TPU platforms. Both sizes come in pre-trained and instruction-tuned forms, ensuring flexibility in their deployment. Designed for accessibility, the models support major AI frameworks and hardware platforms, embodying Google's commitment to responsible AI development with integrated safety measures and risk mitigation tools 1 5 6.
How many models are in the Gemma family?
The Gemma family contains 12 models.
What is the latest Gemma model?
The latest model is Gemma 7B Instruct, released in 2024-02.
How much does Gemma cost?
Gemma models range from $0.04/1M to $0.2/1M input tokens depending on the model and provider.

Models(12)