LLM Reference

Gemma 1.1 7B Instruct on DeepInfra

Gemma · Google DeepMind

Serverless

Pricing

TypePrice (per 1M)
Input tokens$0.05
Output tokens$0.15

Capabilities

VisionMultimodalReasoningFunction CallingTool UseJSON ModeCode Execution

About Gemma 1.1 7B Instruct

The Gemma 1.1 7B Instruct model is a cutting-edge, lightweight large language model developed by Google. As a part of the Gemma model family, it benefits from the same foundational research and technological advancements as Google's Gemini models. Unique to this model is its instruction-tuned training, which allows it to follow directives with greater precision than its base variants. Despite its compact size of 7 billion parameters, making it suitable for deployment on resource-constrained devices like desktops, it excels in diverse tasks including question answering, summarization, logical reasoning, and coding assistance. The model employs a transformer-based, decoder-only architecture, trained on an extensive dataset with an innovative use of Reinforcement Learning from Human Feedback (RLHF) to enhance its quality, factuality, and conversational capabilities. It supports multiple precision levels and is openly available, promoting collaboration in the AI community. Nonetheless, it shares common LLM limitations like potential data biases and factual inaccuracies, which are addressed through guidelines for responsible use.

Get Started

Model Specs

Released2024-02-21
Parameters7B
Context8K
ArchitectureDecoder Only
Knowledge cutoff2023-04