LLM Reference

Gemma 2B Instruct

About

Gemma 2B Instruct is a large language model developed by Google, designed to balance performance and accessibility with its 2 billion parameters. Derived from the Gemini family, it excels in tasks such as text generation, code interpretation, and mathematical problem-solving. Built on a transformer decoder architecture, it features multi-query attention, RoPE, GeGLU activations, and RMSNorm. Trained on approximately 6 trillion tokens, including web documents, code, and mathematical content, it uses SFT and RLHF for instruction-tuning. Notable for its lightweight design permitting deployment on consumer-grade hardware, it's open-source and optimized for dialogue applications. Despite its capabilities, limitations include potential biases, factual inaccuracies, and challenges with complex reasoning.

Capabilities

MultimodalFunction CallingTool UseJSON Mode

Providers(6)

ProviderInput (per 1M)Output (per 1M)Type
Together AI API$0.1$0.1Serverless
GCP Vertex AIServerless
Cloudflare Workers AIServerless
NVIDIA NIMProvisioned
Alibaba Cloud PAI-EASServerless
Replicate APIServerless

Specifications

FamilyGemma
Released2024-02-21
Parameters2B
Context2K
ArchitectureDecoder Only
Knowledge cutoff2023-04
Specializationgeneral