Gemma 4 12B IT on Hugging Face Inference Endpoints

Name: Gemma 4 12B IT on Hugging Face Inference Endpoints
Brand: Google DeepMind
SKU: gemma-4-12b-it-huggingface-inference

Gemma 4 · Google DeepMind

Open Source

Last refreshed 2026-06-10. Next refresh: weekly.

Why use Gemma 4 12B IT on Hugging Face Inference Endpoints?

Hugging Face Inference Endpoints offers Gemma 4 12B IT with competitive pricing. Hugging Face is a leading AI community and platform dedicated to democratizing artificial intelligence.

Compare Gemma 4 12B IT across 2 providers to find the best fit for your use case

Input / 1M

Output / 1M

Cache

Not sourced

Batch

Not sourced

Setup recipe

Docs fallback

Install

Use the provider REST API or SDK

Auth

Create a provider API key

Call

model: google/gemma-4-12B-it

Model ID

google/gemma-4-12B-it

Request example

Curated snippets for this provider are not sourced yet. Use Hugging Face Inference Endpoints documentation with model ID google/gemma-4-12B-it.

Gotchas

Use provider model ID "google/gemma-4-12B-it", not the LLMReference slug "gemma-4-12b-it".

Compare Gemma 4 12B IT Across Providers

Provider	Input (per 1M)	Output (per 1M)
Hugging Face Inference Endpoints	—	—
Kaggle Models	—	—

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsAudioFine-tuning

About Gemma 4 12B IT

Instruction-tuned version of Gemma 4 12B. Open weight (Apache 2.0), 12B parameters, encoder-free multimodal (text, image, audio). Optimized for chat and instruction-following. Runs on a 16GB laptop.

FAQ

What is the context window for Gemma 4 12B IT on Hugging Face Inference Endpoints?

Gemma 4 12B IT supports a 262k token context window on Hugging Face Inference Endpoints.

What API model ID do I use for Gemma 4 12B IT on Hugging Face Inference Endpoints?

Use the model ID google/gemma-4-12B-it when calling Hugging Face Inference Endpoints's API.

Who created Gemma 4 12B IT?

Gemma 4 12B IT was created by Google DeepMind as part of the Gemma 4 model family.

Is Gemma 4 12B IT open source?

Gemma 4 12B IT is open source under Apache 2.0 according to the seed data.

Get Started

Model Card Docs Portal Playground Pricing

Model Specs

Released2026-06-03

Parameters12B

Context256k

ArchitectureDecoder Only

Knowledge cutoff2025-01

Hugging Face

All models on Hugging Face Inference Endpoints →Provider setup guide →