LLM Reference
Replicate API

Gemma 7B Instruct on Replicate API

Gemma · Google DeepMind

Serverless

Pricing

TypePrice (per 1M)
Input tokens$0.05
Output tokens$0.25

Capabilities

VisionMultimodalReasoningFunction CallingTool UseJSON ModeCode Execution

About Gemma 7B Instruct

Gemma 7B Instruct is a cutting-edge large language model developed by Google DeepMind, boasting 7 billion parameters. As part of the Gemma family, it benefits from the advanced research underpinning Google's Gemini models. This model is optimized for text generation tasks, excelling in areas like question answering and summarization, and it is finely tuned to follow instructions effectively. Despite its compact size, Gemma 7B Instruct performs impressively on benchmarks, making it versatile for deployment across various hardware platforms, from laptops to cloud infrastructure. Moreover, it is open-source, with accessible weights and incorporates responsible AI practices, such as data filtering and human feedback, to ensure safe and ethical use.

Get Started

Model Specs

Released2024-02-21
Parameters7B
Context8K
ArchitectureDecoder Only
Knowledge cutoff2023-04

Related Models on Replicate API