Gemma 3 12B
gemma-3-12b-it
Last refreshed 2026-05-19. Next refresh: weekly.
Gemma 3 12B is worth evaluating for classification and json / tool use when its provider route and context window match the workload.
Decision context: Classification task fit, 3 tracked provider routes, and research from 2026-05-19.
Use it for
- Teams evaluating classification and json / tool use
- Workloads that can use a 33K context window
- Buyers comparing 3 tracked provider routes
Do not use it for
- Vision or document-understanding workloads
Cheapest output
$0.130
GCP Vertex AI per 1M tokens
Provider routes
3
Tracked API hosts
Quality / dollar
Unknown
No task benchmark coverage yet
Freshness
2026-05-19
Researched 2d ago
Top use-case fit
Classification
Included by capability and metadata signals in the decision map.
JSON / Tool use
Included by capability and metadata signals in the decision map.
Provider price ladder
Compare all 3| Provider | Input / 1M | Output / 1M | Route |
|---|---|---|---|
| GCP Vertex AI | $0.040 | $0.130 | Serverless |
| OpenRouter | $0.040 | $0.130 | Serverless |
| AWS Bedrock | $0.300 | $0.300 | Serverless |
Benchmark peer barsfor Classification
No task-mapped benchmark peers are available for this model yet.
Migration checks
No linked migration route is available for this model yet.
About
Gemma 3 12B is Google DeepMind's Gemma 3 model. It offers a 33K-token context window with weights openly available for self-hosting.
Gemma 3 12B has a 33K-token context window.
Gemma 3 12B input tokens at $0.04/1M, output at $0.13/1M.