LLM Reference

Gemma 3 12B

gemma-3-12b-it

Researched 2d ago

Last refreshed 2026-05-19. Next refresh: weekly.

Open SourceClassificationJSON / Tool use

Gemma 3 12B is worth evaluating for classification and json / tool use when its provider route and context window match the workload.

Decision context: Classification task fit, 3 tracked provider routes, and research from 2026-05-19.

Use it for

  • Teams evaluating classification and json / tool use
  • Workloads that can use a 33K context window
  • Buyers comparing 3 tracked provider routes

Do not use it for

  • Vision or document-understanding workloads

Cheapest output

$0.130

GCP Vertex AI per 1M tokens

Provider routes

3

Tracked API hosts

Quality / dollar

Unknown

No task benchmark coverage yet

Freshness

2026-05-19

Researched 2d ago

fresh

Top use-case fit

Classification

Included by capability and metadata signals in the decision map.

JSON / Tool use

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare all 3
ProviderInput / 1MOutput / 1MRoute
GCP Vertex AI$0.040$0.130
Serverless
OpenRouter$0.040$0.130
Serverless
AWS Bedrock$0.300$0.300
Serverless

Benchmark peer barsfor Classification

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

About

Gemma 3 12B is Google DeepMind's Gemma 3 model. It offers a 33K-token context window with weights openly available for self-hosting.

Gemma 3 12B has a 33K-token context window.

Gemma 3 12B input tokens at $0.04/1M, output at $0.13/1M.

Capabilities

Structured Outputs

Rankings

Specifications

FamilyGemma 3
Released2026-01-01
Context33K
ArchitectureDecoder Only
Knowledge cutoff2024-08
Specializationgeneral

Created by

Pioneering artificial intelligence research.

London, United Kingdom
Founded 2014
Website