Gemma 3 4B Instruct

Name: Gemma 3 4B Instruct
Author: Google DeepMind

Released

2025-01-01

Last refreshed

2026-05-19

Status

Researched 44d ago

Open weightsCommercial use: conditionalLong context

Gemma 3 4B Instruct is worth evaluating for long context when its provider route and context window match the workload.

Use it for

Teams evaluating long context
Workloads that can use a 128k context window
Buyers comparing 1 tracked provider route

Do not use it for

Vision or document-understanding workloads
Strict JSON or tool-calling flows

Specifications

Family: Gemma 3
Released: 2025-01-01
Context: 128k
Parameters: 4B
Architecture: Decoder Only
Knowledge cutoff: 2024-08
Specialization: general
Openness: Open weights
License: GemmaCommercial use: conditional
Training: Pretrained

Created by

Google DeepMind

Pioneering artificial intelligence research.

London, United Kingdom

Founded 2014

Website

Pricing

Output / 1M

$0.200

Input / 1M

$0.200

Cheapest of 1 route · Fireworks AI

Providers(1)

Fireworks AI

View 1 provider route

About

Gemma 3 4B Instruct is Google DeepMind's Gemma 3 model. It offers a 128K-token context window with weights openly available for self-hosting.

Gemma 3 4B Instruct is an open-weight model in the Gemma 3 family. The structured metadata tracks a 128k-token context window. This page tracks provider routes through Fireworks AI, with the cheapest tracked route listed at $0.2 input and $0.2 output per 1M tokens. No headline benchmark score is tracked for Gemma 3 4B Instruct yet.

Top use-case fit

Long context

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Route
Fireworks AI	$0.200	$0.200	Serverless

Available via routers & gateways(1)

OpenRouter

Hybrid

Unified hybrid gateway to 400+ models from 60+ providers via a single OpenAI-compatible API, with optional auto-routing that selects the best model per prompt.

PassthroughFireworks AI