LLM Reference

Gemma 3 Models by Google DeepMind

19 models2024–2026Up to 131k ctxFrom $0.02/1M input

About

Gemma 3 is a family of 19 AI models by Google DeepMind, released between 2024 and 2026.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

19 in view

Use when the workload needs 8k context, 4B parameters, and tool use.

2026-038k context4B parameterstool use

Use when the workload needs 33k context, 12B parameters, and structured outputs.

2026-0133k context12B parametersstructured outputs

Use when the workload needs 128k context, 27B parameters, and structured outputs.

2026-01128k context27B parametersstructured outputs

Use when the workload needs 128k context, 4B parameters, and structured outputs.

2026-01128k context4B parametersstructured outputs

Use when the workload needs 8k context.

2025-048k context

Use when the workload needs 8k context and structured outputs.

2025-048k contextstructured outputs
Gemma 3Current

Use when the workload needs structured outputs.

2025-03structured outputs
Gemma 3nCurrent

Use when the workload needs 32k context and structured outputs.

2025-0332k contextstructured outputs

Use when the workload needs 131k context, 27B parameters, and structured outputs.

2025-03131k context27B parametersstructured outputs

Use when the workload needs 128k context and 12B parameters.

2025-01128k context12B parameters

Use when the workload needs 32k context and 1B parameters.

2025-0132k context1B parameters

Use when the workload needs 128k context and 27B parameters.

2025-01128k context27B parameters

Use when the workload needs 128k context and 4B parameters.

2025-01128k context4B parameters

Use when the workload needs safety, 4B parameters, and tool use.

2024-09safety4B parameterstool use
MedGemmaCurrent

Use when the workload needs 4B parameters, tool use, and function calling.

2024-074B parameterstool usefunction calling
MedSigLIPCurrent

Use when the workload needs 400M parameters, tool use, and function calling.

2024-07400M parameterstool usefunction calling
TxGemmaCurrent

Use when the workload needs 2B parameters, tool use, and function calling.

2024-062B parameterstool usefunction calling
T5GemmaCurrent

Use when the workload needs 2B parameters, tool use, and function calling.

2024-042B parameterstool usefunction calling
PaliGemmaCurrent

Use when the workload needs 3B parameters, tool use, and function calling.

2024-033B parameterstool usefunction calling

Release Timeline

10 release groups
2026-03
1 current
Together AI - Gemma 3n-e4B
8k context4B parameterstool use
Current
2026-01
3 current
Gemma 3 12B
33k context12B parametersstructured outputs
Current
Gemma 3 27B PT
128k context27B parametersstructured outputs
Current
Gemma 3 4B IT
128k context4B parametersstructured outputs
Current
2025-04
2 current
Current
Gemma 3n 4B (free)
8k contextstructured outputs
Current
2025-03
3 current
Gemma 3
structured outputs
Current
Gemma 3 27B
131k context27B parametersstructured outputs
Current
Gemma 3n
32k contextstructured outputs
Current
2025-01
4 current
Gemma 3 12B Instruct
128k context12B parameters
Current
Gemma 3 1B Instruct
32k context1B parameters
Current
Gemma 3 27B Instruct
128k context27B parameters
Current
Gemma 3 4B Instruct
128k context4B parameters
Current
2024-09
1 current
ShieldGemma 2
safety4B parameterstool use
Current
2024-07
2 current
MedGemma
4B parameterstool usefunction calling
Current
MedSigLIP
400M parameterstool usefunction calling
Current
2024-06
1 current
TxGemma
2B parameterstool usefunction calling
Current

Specifications(19 models)

Gemma 3 model specifications comparison
ModelReleasedContextParametersVisionMultimodalFn CallingTool UseStructured Outputs
Together AI - Gemma 3n-e4B2026-038k4BNoNoYesYesYes
Gemma 3 12B2026-0133k12BNoNoNoNoYes
Gemma 3 27B PT2026-01128k27BNoNoNoNoYes
Gemma 3 4B IT2026-01128k4BNoNoNoNoYes
Gemma 3n 2B (free)2025-048k5B (2B effective active)NoNoNoNoNo
Gemma 3n 4B (free)2025-048k8B (4B effective active)NoNoNoNoYes
Gemma 32025-03NoNoNoNoYes
Gemma 3n2025-0332kNoNoNoNoYes
Gemma 3 27B2025-03131k27BNoNoNoNoYes
Gemma 3 12B Instruct2025-01128k12BNoNoNoNoNo
Gemma 3 1B Instruct2025-0132k1BNoNoNoNoNo
Gemma 3 27B Instruct2025-01128k27BNoNoNoNoNo
Gemma 3 4B Instruct2025-01128k4BNoNoNoNoNo
ShieldGemma 22024-094BYesYesYesYesYes
MedGemma2024-074BYesYesYesYesYes
MedSigLIP2024-07400MYesYesYesYesYes
TxGemma2024-062BNoNoYesYesYes
T5Gemma2024-042BNoNoYesYesYes
PaliGemma2024-033BYesYesYesYesYes

Available From(9 providers)

Pricing

Gemma 3 model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Together AI - Gemma 3n-e4BTogether AI$0.02$0.04Serverless
Gemma 3n 4B (free)Together AI$0.02$0.04Serverless
Gemma 3OpenRouter$0.04$0.08Serverless
Gemma 3 4B ITOpenRouter$0.04$0.08Serverless
Gemma 3 12BOpenRouter$0.04$0.13Serverless
Gemma 3 4B ITGCP Vertex AI$0.04$0.08Serverless
Gemma 3 12BGCP Vertex AI$0.04$0.13Serverless
Gemma 3 12BNovita AI$0.05$0.1Serverless
Gemma 3n 4B (free)OpenRouter$0.06$0.12Serverless
Gemma 3 27BOpenRouter$0.08$0.16Serverless
Gemma 3 27BGCP Vertex AI$0.08$0.16Serverless
Gemma 3 1B InstructFireworks AI$0.1$0.1Serverless
Gemma 3 27BNovita AI$0.119$0.2Serverless
Gemma 3 12B InstructFireworks AI$0.2$0.2Serverless
Gemma 3 4B InstructFireworks AI$0.2$0.2Serverless
Gemma 3 4B ITAWS Bedrock$0.2$0.2Serverless
Gemma 3 27B PTAWS Bedrock$0.23$0.38Serverless
Gemma 3 12BAWS Bedrock$0.3$0.3Serverless
Gemma 3 12BCloudflare Workers AI$0.345$0.556Serverless
Gemma 3 27BAWS Bedrock$0.5$0.5Serverless
Gemma 3 27B InstructFireworks AI$0.9$0.9Serverless

Comparisons

All comparisons →

Frequently Asked Questions

What is Gemma 3 used for?
Gemma 3 is used for safety, vision and multimodal work, and agent workflows and tool use. The family description and listed model capabilities point to those workloads as the best fit.
How does Gemma 3 compare to Gemma 4?
Gemma 3 by Google DeepMind is strongest where you need safety, while Gemma 4 by Google DeepMind is the closest related family to check for multimodal. Gemma 3 has 19 listed variants and reaches up to 131k context, while Gemma 4 reaches up to 256k context, so compare the specs and pricing tables before choosing a production model.
Which Gemma 3 model should I use?
For the lowest listed input price, start with Together AI - Gemma 3n-e4B through Together AI at $0.02/1M input tokens. For the most capable/latest local choice, evaluate ShieldGemma 2 with tool use, function calling, structured outputs, and multimodal inputs.

Models(19)