LLM ReferenceLLM Reference

Gemma 7B Instruct

gemma-7b-it

Open Source

About

Gemma 7B Instruct is a cutting-edge large language model developed by Google DeepMind, boasting 7 billion parameters. As part of the Gemma family, it benefits from the advanced research underpinning Google's Gemini models. This model is optimized for text generation tasks, excelling in areas like question answering and summarization, and it is finely tuned to follow instructions effectively. Despite its compact size, Gemma 7B Instruct performs impressively on benchmarks, making it versatile for deployment across various hardware platforms, from laptops to cloud infrastructure. Moreover, it is open-source, with accessible weights and incorporates responsible AI practices, such as data filtering and human feedback, to ensure safe and ethical use.

Gemma 7B Instruct has a 8K-token context window.

Gemma 7B Instruct input tokens at $0.05/1M, output at $0.25/1M.

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution

Providers(8)

Compare all →
ProviderInput (per 1M)Output (per 1M)Type
NVIDIA NIMProvisioned
Fireworks AI$0.20$0.20Provisioned
Together AI$0.2$0.2Serverless
GCP Vertex AI$0.10$0.30Serverless
Cloudflare Workers AIServerless
Alibaba Cloud PAI-EASServerless
Lepton AI API$0.07$0.07Serverless
Replicate API$0.05$0.25Serverless

Benchmark Scores(5)

BenchmarkScoreVersionSource
Google-Proof Q&A50.8diamondresearch
HellaSwag89.210-shotresearch
HumanEval70.1pass@1research
Massive Multitask Language Understanding75.35-shotresearch
Instruction-Following Evaluation42.6v2https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard

Rankings

Specifications

FamilyGemma
Released2024-02-21
Parameters7B
Context8K
ArchitectureDecoder Only
Knowledge cutoff2023-04
Specializationgeneral
Trainingfinetuned

Created by

Pioneering artificial intelligence research.

London, United Kingdom
Founded 2014
Website