llmreference

Gemma 4 E4B

gemma-4-e4b

Researched 28d ago

Last refreshed 2026-05-11. Next refresh: weekly.

Open SourceMultimodalRAGAgentsLong contextVisionJSON / Tool use

Gemma 4 E4B is worth evaluating for rag, agents, and long context when its provider route and context window match the workload.

Decision context: RAG task fit, 1 tracked provider route, and research from 2026-04-21.

Use it for

  • Teams evaluating rag, agents, and long context
  • Workloads that can use a 128k context window
  • Buyers comparing 1 tracked provider route

Do not use it for

  • Workloads where another current model has stronger sourced task evidence

Cheapest output

Free

GCP Vertex AI per 1M tokens

Provider routes

1

Tracked API hosts

Quality / dollar

Unknown

No task benchmark coverage yet

Freshness

2026-04-21

Researched 28d ago

fresh

Top use-case fit

RAG

Included by capability and metadata signals in the decision map.

Agents

Included by capability and metadata signals in the decision map.

Long context

Included by capability and metadata signals in the decision map.

Provider price ladder

ProviderInput / 1MOutput / 1MRoute
GCP Vertex AIFreeFree
Serverless

Benchmark peer barsfor RAG

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

About

Efficient 4B model with native audio input support. Balances performance and efficiency for edge and on-device deployment with reasoning and coding capabilities.

Gemma 4 E4B has a 128K-token context window.

Gemma 4 E4B input tokens at $0/1M, output at $0/1M.

Capabilities

MultimodalFunction Calling

Rankings

Specifications

FamilyGemma 4
Released2026-03-31
Parameters4B
Context128k
Knowledge cutoff2025-01

Created by

Pioneering artificial intelligence research.

London, United Kingdom
Founded 2014
Website

Providers(1)