LLM ReferenceLLM Reference

Gemma 2 9B Instruct

gemma-2-9b-it

Open Source

About

Gemma 2 9B Instruct, developed by Google, is a state-of-the-art large language model based on the advanced Gemini framework. It is a decoder-only transformer model with 9 billion parameters, offering a balance between size and performance. The model is trained on an expansive dataset comprising 8 trillion tokens, including web documents, code, and mathematical text, a notable 30% increase from its predecessor, Gemma 1.1. This allows it to adeptly handle diverse tasks such as question answering, creative writing, coding, and mathematical problem-solving. However, it shares common limitations of large language models, such as potential biases and the risk of generating inaccuracies or outdated information. Notably, Gemma 2 9B Instruct incorporates Grouped-Query Attention (GQA) and uses the GeGLU activation function, and is specifically fine-tuned to follow instructions and participate effectively in multi-turn dialogues.

Gemma 2 9B Instruct has a 8K-token context window.

Gemma 2 9B Instruct input tokens at $0.03/1M, output at $0.09/1M.

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution

Providers(5)

Compare all →
ProviderInput (per 1M)Output (per 1M)Type
Fireworks AI$0.2$0.2Serverless
NVIDIA NIMProvisioned
OpenRouter$0.03$0.09Serverless
Chutes AI$0.1$0.3Serverless
Replicate API$0.1$0.1Serverless

Benchmark Scores(1)

BenchmarkScoreVersionSource
Instruction-Following Evaluation65.5v2https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard

Rankings

Specifications

FamilyGemma 2
Released2024-06-27
Parameters9B
Context8K
ArchitectureDecoder Only
Specializationgeneral
Trainingfinetuned

Created by

Pioneering artificial intelligence research.

London, United Kingdom
Founded 2014
Website