Gemma 4 12B on Hugging Face Inference Endpoints

Name: Gemma 4 12B on Hugging Face Inference Endpoints
Brand: Google DeepMind
SKU: gemma-4-12b-huggingface-inference

Gemma 4 · Google DeepMind

Open Source

Last refreshed 2026-06-11. Next refresh: weekly.

Why use Gemma 4 12B on Hugging Face Inference Endpoints?

Hugging Face Inference Endpoints offers Gemma 4 12B with competitive pricing. Hugging Face is a leading AI community and platform dedicated to democratizing artificial intelligence.

Compare Gemma 4 12B across 2 providers to find the best fit for your use case

Input / 1M

Output / 1M

Cache

Not sourced

Batch

Not sourced

Setup recipe

Docs fallback

Install

Use the provider REST API or SDK

Auth

Create a provider API key

Call

model: google/gemma-4-12B

Model ID

google/gemma-4-12B

Request example

Curated snippets for this provider are not sourced yet. Use Hugging Face Inference Endpoints documentation with model ID google/gemma-4-12B.

Gotchas

Use provider model ID "google/gemma-4-12B", not the LLMReference slug "gemma-4-12b".

Compare Gemma 4 12B Across Providers

Provider	Input (per 1M)	Output (per 1M)
Hugging Face Inference Endpoints	—	—
Kaggle Models	—	—

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsAudio

About Gemma 4 12B

Google DeepMind's 12B open-weight multimodal model (Apache 2.0), designed to run on a 16GB laptop. First medium-sized model with native audio ingestion alongside text and image. Unified encoder-free decoder-only architecture. Supports 140+ languages. MMLU Pro: 77.2%.

FAQ

What is the context window for Gemma 4 12B on Hugging Face Inference Endpoints?

Gemma 4 12B supports a 262k token context window on Hugging Face Inference Endpoints.

What API model ID do I use for Gemma 4 12B on Hugging Face Inference Endpoints?

Use the model ID google/gemma-4-12B when calling Hugging Face Inference Endpoints's API.

Who created Gemma 4 12B?

Gemma 4 12B was created by Google DeepMind as part of the Gemma 4 model family.

Is Gemma 4 12B open source?

Gemma 4 12B is open source under Apache 2.0 according to the seed data.

Get Started

Model Card Docs Portal Playground Pricing

Model Specs

Released2026-06-03

Parameters12B

Context256k

ArchitectureDecoder Only

Knowledge cutoff2025-01

Hugging Face

All models on Hugging Face Inference Endpoints →Provider setup guide →