Gemma 3 Models by Google DeepMind
About
Gemma 3 is a family of 19 AI models by Google DeepMind, released between 2024 and 2026.
Current Variants
Use-when guidance is derived from seed capabilities, context, release, and replacement fields.
Use when the workload needs 8k context, 4B parameters, and tool use.
Use when the workload needs 33k context, 12B parameters, and structured outputs.
Use when the workload needs 128k context, 27B parameters, and structured outputs.
Use when the workload needs 128k context, 4B parameters, and structured outputs.
Use when the workload needs 8k context and structured outputs.
Use when the workload needs 32k context and structured outputs.
Use when the workload needs 131k context, 27B parameters, and structured outputs.
Use when the workload needs 128k context and 12B parameters.
Use when the workload needs 32k context and 1B parameters.
Use when the workload needs 128k context and 27B parameters.
Use when the workload needs 128k context and 4B parameters.
Use when the workload needs safety, 4B parameters, and tool use.
Use when the workload needs 4B parameters, tool use, and function calling.
Use when the workload needs 400M parameters, tool use, and function calling.
Use when the workload needs 2B parameters, tool use, and function calling.
Use when the workload needs 2B parameters, tool use, and function calling.
Use when the workload needs 3B parameters, tool use, and function calling.
| Model | Use when | Released | Signals | Status |
|---|---|---|---|---|
| Together AI - Gemma 3n-e4B | Use when the workload needs 8k context, 4B parameters, and tool use. | 2026-03 | 8k context4B parameterstool use | Current |
| Gemma 3 12B | Use when the workload needs 33k context, 12B parameters, and structured outputs. | 2026-01 | 33k context12B parametersstructured outputs | Current |
| Gemma 3 27B PT | Use when the workload needs 128k context, 27B parameters, and structured outputs. | 2026-01 | 128k context27B parametersstructured outputs | Current |
| Gemma 3 4B IT | Use when the workload needs 128k context, 4B parameters, and structured outputs. | 2026-01 | 128k context4B parametersstructured outputs | Current |
| Gemma 3n 2B (free) | Use when the workload needs 8k context. | 2025-04 | 8k context | Current |
| Gemma 3n 4B (free) | Use when the workload needs 8k context and structured outputs. | 2025-04 | 8k contextstructured outputs | Current |
| Gemma 3 | Use when the workload needs structured outputs. | 2025-03 | structured outputs | Current |
| Gemma 3n | Use when the workload needs 32k context and structured outputs. | 2025-03 | 32k contextstructured outputs | Current |
| Gemma 3 27B | Use when the workload needs 131k context, 27B parameters, and structured outputs. | 2025-03 | 131k context27B parametersstructured outputs | Current |
| Gemma 3 12B Instruct | Use when the workload needs 128k context and 12B parameters. | 2025-01 | 128k context12B parameters | Current |
| Gemma 3 1B Instruct | Use when the workload needs 32k context and 1B parameters. | 2025-01 | 32k context1B parameters | Current |
| Gemma 3 27B Instruct | Use when the workload needs 128k context and 27B parameters. | 2025-01 | 128k context27B parameters | Current |
| Gemma 3 4B Instruct | Use when the workload needs 128k context and 4B parameters. | 2025-01 | 128k context4B parameters | Current |
| ShieldGemma 2 | Use when the workload needs safety, 4B parameters, and tool use. | 2024-09 | safety4B parameterstool use | Current |
| MedGemma | Use when the workload needs 4B parameters, tool use, and function calling. | 2024-07 | 4B parameterstool usefunction calling | Current |
| MedSigLIP | Use when the workload needs 400M parameters, tool use, and function calling. | 2024-07 | 400M parameterstool usefunction calling | Current |
| TxGemma | Use when the workload needs 2B parameters, tool use, and function calling. | 2024-06 | 2B parameterstool usefunction calling | Current |
| T5Gemma | Use when the workload needs 2B parameters, tool use, and function calling. | 2024-04 | 2B parameterstool usefunction calling | Current |
| PaliGemma | Use when the workload needs 3B parameters, tool use, and function calling. | 2024-03 | 3B parameterstool usefunction calling | Current |
Release Timeline
10 release groupsSpecifications(19 models)
| Model | Released | Context | Parameters | Vision | Multimodal | Fn Calling | Tool Use | Structured Outputs |
|---|---|---|---|---|---|---|---|---|
| Together AI - Gemma 3n-e4B | 2026-03 | 8k | 4B | No | No | Yes | Yes | Yes |
| Gemma 3 12B | 2026-01 | 33k | 12B | No | No | No | No | Yes |
| Gemma 3 27B PT | 2026-01 | 128k | 27B | No | No | No | No | Yes |
| Gemma 3 4B IT | 2026-01 | 128k | 4B | No | No | No | No | Yes |
| Gemma 3n 2B (free) | 2025-04 | 8k | 5B (2B effective active) | No | No | No | No | No |
| Gemma 3n 4B (free) | 2025-04 | 8k | 8B (4B effective active) | No | No | No | No | Yes |
| Gemma 3 | 2025-03 | — | — | No | No | No | No | Yes |
| Gemma 3n | 2025-03 | 32k | — | No | No | No | No | Yes |
| Gemma 3 27B | 2025-03 | 131k | 27B | No | No | No | No | Yes |
| Gemma 3 12B Instruct | 2025-01 | 128k | 12B | No | No | No | No | No |
| Gemma 3 1B Instruct | 2025-01 | 32k | 1B | No | No | No | No | No |
| Gemma 3 27B Instruct | 2025-01 | 128k | 27B | No | No | No | No | No |
| Gemma 3 4B Instruct | 2025-01 | 128k | 4B | No | No | No | No | No |
| ShieldGemma 2 | 2024-09 | — | 4B | Yes | Yes | Yes | Yes | Yes |
| MedGemma | 2024-07 | — | 4B | Yes | Yes | Yes | Yes | Yes |
| MedSigLIP | 2024-07 | — | 400M | Yes | Yes | Yes | Yes | Yes |
| TxGemma | 2024-06 | — | 2B | No | No | Yes | Yes | Yes |
| T5Gemma | 2024-04 | — | 2B | No | No | Yes | Yes | Yes |
| PaliGemma | 2024-03 | — | 3B | Yes | Yes | Yes | Yes | Yes |
Available From(9 providers)
Pricing
Frequently Asked Questions
- What is Gemma 3 used for?
- Gemma 3 is used for safety, vision and multimodal work, and agent workflows and tool use. The family description and listed model capabilities point to those workloads as the best fit.
- How does Gemma 3 compare to Gemma 4?
- Gemma 3 by Google DeepMind is strongest where you need safety, while Gemma 4 by Google DeepMind is the closest related family to check for multimodal. Gemma 3 has 19 listed variants and reaches up to 131k context, while Gemma 4 reaches up to 256k context, so compare the specs and pricing tables before choosing a production model.
- Which Gemma 3 model should I use?
- For the lowest listed input price, start with Together AI - Gemma 3n-e4B through Together AI at $0.02/1M input tokens. For the most capable/latest local choice, evaluate ShieldGemma 2 with tool use, function calling, structured outputs, and multimodal inputs.
Models(19)
Together AI - Gemma 3n-e4B
Gemma 3 12B
Gemma 3 27B PT
Gemma 3 4B IT
Gemma 3n 2B (free)
Gemma 3n 4B (free)
Gemma 3
Gemma 3n
Gemma 3 27B
Gemma 3 12B Instruct
Gemma 3 1B Instruct
Gemma 3 27B Instruct
Gemma 3 4B Instruct
ShieldGemma 2
MedGemma
MedSigLIP
TxGemma
T5Gemma
PaliGemma






