LLM Reference

Gemma 2B Instruct

Released
2024-02-21
Last refreshed
2026-06-15
Status
Researched 60d ago
DeprecatedOpen weightsCommercial use: conditionalClassificationJSON / Tool use

Gemma 2B Instruct is a legacy integration reference; keep it only while you identify a current replacement.

Use it for

  • Teams maintaining an existing integration
  • Workloads that can use a 2k context window
  • Buyers comparing 4 tracked provider routes

Do not use it for

  • New production launches
  • Vision or document-understanding workloads
Specifications
Family
Gemma
Released
2024-02-21
Context
2k
Parameters
2B
Architecture
Decoder Only
Knowledge cutoff
2023-04
Specialization
general
Openness
Open weights
License
GemmaCommercial use: conditional
Training
Fine-tuned
Created by

Pioneering artificial intelligence research.

London, United Kingdom
Founded 2014
Website
Pricing
Output / 1M
$0.100
Input / 1M
$0.100

Cheapest of 7 routes · Fireworks AI

About

Gemma 2B Instruct is a large language model developed by Google, designed to balance performance and accessibility with its 2 billion parameters. Derived from the Gemini family, it excels in tasks such as text generation, code interpretation, and mathematical problem-solving. Built on a transformer decoder architecture, it features multi-query attention, RoPE, GeGLU activations, and RMSNorm. Trained on approximately 6 trillion tokens, including web documents, code, and mathematical content, it uses SFT and RLHF for instruction-tuning. Notable for its lightweight design permitting deployment on consumer-grade hardware, it's open-source and optimized for dialogue applications. Despite its capabilities, limitations include potential biases, factual inaccuracies, and challenges with complex reasoning.

Gemma 2B Instruct is an open-weight model in the Gemma family. The structured metadata tracks a 2k-token context window and structured outputs. This page tracks provider routes through Together AI, GCP Vertex AI, Cloudflare Workers AI, and 4 more, with the cheapest tracked route listed at $0.04 input and $0.12 output per 1M tokens. No headline benchmark score is tracked for Gemma 2B Instruct yet.

Top use-case fit

Classification

Included by capability and metadata signals in the decision map.

JSON / Tool use

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare all 7

Compare API pricing across 4 providers for input and output tokens, batch, and cached reads when available.

ProviderInput / 1MOutput / 1MRoute
Fireworks AI$0.100$0.100
Serverless
Together AI$0.100$0.100
Serverless
GCP Vertex AI$0.040$0.120
Serverless
Replicate API$0.050$0.250
Serverless

Available via routers & gateways(14)

Capabilities

Structured Outputs

Benchmark peer barsfor Classification

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.