LLM Reference

Gemma 3 4B IT

Released
2026-01-01
Last refreshed
2026-06-15
Status
Researched 44d ago
Open weightsCommercial use: conditionalRAGLong contextClassificationJSON / Tool use

Gemma 3 4B IT is worth evaluating for rag, long context, and classification when its provider route and context window match the workload.

Use it for

  • Teams evaluating rag, long context, and classification
  • Workloads that can use a 128k context window
  • Buyers comparing 3 tracked provider routes

Do not use it for

  • Vision or document-understanding workloads
Specifications
Family
Gemma 3
Released
2026-01-01
Context
128k
Parameters
4B
Knowledge cutoff
2024-08
Openness
Open weights
License
GemmaCommercial use: conditional
Weights
Unknown
Code
Unknown
Created by

Pioneering artificial intelligence research.

London, United Kingdom
Founded 2014
Website
Pricing
Output / 1M
$0.080
Input / 1M
$0.040

Cheapest of 3 routes · GCP Vertex AI

About

Gemma 3 4B IT is Google DeepMind's Gemma 3 model. Its knowledge cutoff is 2024-08.

Gemma 3 4B IT is an open-weight model in the Gemma 3 family. The structured metadata tracks a 128k-token context window and structured outputs. This page tracks provider routes through AWS Bedrock, OpenRouter, and GCP Vertex AI, with the cheapest tracked route listed at $0.04 input and $0.08 output per 1M tokens. No headline benchmark score is tracked for Gemma 3 4B IT yet.

Top use-case fit

RAG

Included by capability and metadata signals in the decision map.

Long context

Included by capability and metadata signals in the decision map.

Classification

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare all 3

Compare API pricing across 3 providers for input and output tokens, batch, and cached reads when available.

ProviderInput / 1MOutput / 1MRoute
GCP Vertex AI$0.040$0.080
Serverless
OpenRouter$0.040$0.080
Serverless
AWS Bedrock$0.200$0.200
Serverless

Available via routers & gateways(14)

Capabilities

Structured Outputs

Benchmark peer barsfor RAG

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

Frequently asked questions

What is the context window of Gemma 3 4B IT?

Gemma 3 4B IT has a context window of 128k tokens.

How much does Gemma 3 4B IT cost?

Gemma 3 4B IT pricing ranges from $0.04/1M to $0.20/1M input tokens depending on the provider.

When was Gemma 3 4B IT released?

Gemma 3 4B IT was released on 2026-01-01.

Which providers offer Gemma 3 4B IT?

Gemma 3 4B IT is available from 3 providers: AWS Bedrock, OpenRouter, GCP Vertex AI.