LLM Reference

Gemma 7B Instruct

About

Gemma 7B Instruct is a cutting-edge large language model developed by Google DeepMind, boasting 7 billion parameters. As part of the Gemma family, it benefits from the advanced research underpinning Google's Gemini models. This model is optimized for text generation tasks, excelling in areas like question answering and summarization, and it is finely tuned to follow instructions effectively. Despite its compact size, Gemma 7B Instruct performs impressively on benchmarks, making it versatile for deployment across various hardware platforms, from laptops to cloud infrastructure. Moreover, it is open-source, with accessible weights and incorporates responsible AI practices, such as data filtering and human feedback, to ensure safe and ethical use.

Capabilities

MultimodalFunction CallingTool UseJSON Mode

Providers(10)

ProviderInput (per 1M)Output (per 1M)Type
NVIDIA NIMProvisioned
GroqCloud$0.07$0.07Serverless
Snowflake Cortex$0.24$0.24Serverless
Fireworks AI PlatformProvisioned
Together AI API$0.2$0.2Serverless
Replicate APIServerless
GCP Vertex AIServerless
Cloudflare Workers AIServerless
Alibaba Cloud PAI-EASServerless
Lepton AI APIServerless

Benchmark Scores(4)

BenchmarkScoreVersionSource
Google-Proof Q&A50.8diamondresearch
HellaSwag89.210-shotresearch
HumanEval70.1pass@1research
Massive Multitask Language Understanding75.35-shotresearch

Specifications

FamilyGemma
Released2024-02-21
Parameters7B
Context8K
ArchitectureDecoder Only
Knowledge cutoff2023-04
Specializationgeneral