Gemma 3 4B Instruct
Gemma 3 4B Instruct is worth evaluating for long context when its provider route and context window match the workload.
Use it for
- Teams evaluating long context
- Workloads that can use a 128k context window
- Buyers comparing 1 tracked provider route
Do not use it for
- Vision or document-understanding workloads
- Strict JSON or tool-calling flows
Cheapest of 1 route · Fireworks AI
About
Gemma 3 4B Instruct is Google DeepMind's Gemma 3 model. It offers a 128K-token context window with weights openly available for self-hosting.
Gemma 3 4B Instruct is an open-weight model in the Gemma 3 family. The structured metadata tracks a 128k-token context window. This page tracks provider routes through Fireworks AI, with the cheapest tracked route listed at $0.2 input and $0.2 output per 1M tokens. No headline benchmark score is tracked for Gemma 3 4B Instruct yet.
Top use-case fit
Long context
Included by capability and metadata signals in the decision map.
Provider price ladder
Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.
| Provider | Input / 1M | Output / 1M | Route |
|---|---|---|---|
| Fireworks AI | $0.200 | $0.200 | Serverless |
Available via routers & gateways(1)
Capabilities
No model capability flags are currently sourced.
Benchmark peer barsfor Long context
No task-mapped benchmark peers are available for this model yet.
Migration checks
No linked migration route is available for this model yet.
Frequently asked questions
What is the context window of Gemma 3 4B Instruct?
Gemma 3 4B Instruct has a context window of 128k tokens.
How much does Gemma 3 4B Instruct cost?
Gemma 3 4B Instruct is available at $0.2/1M input tokens through Fireworks AI.
When was Gemma 3 4B Instruct released?
Gemma 3 4B Instruct was released on 2025-01-01.
Which providers offer Gemma 3 4B Instruct?
Gemma 3 4B Instruct is available from 1 provider: Fireworks AI.
Cheapest of 1 route · Fireworks AI