LLM Reference

Using Gemma 4 12B IT on Kaggle Models

Implementation guide · Gemma 4 · Google DeepMind

Open Source

Quick Start

  1. 1
    Create an account at Kaggle Models and generate an API key.
  2. 2
    Use the Kaggle Models SDK or REST API to call google/gemma-4 — see the documentation for request format.

Code Examples

See Kaggle Models documentation for integration details.

About Kaggle Models

Kaggle Models hosts model collections with downloadable artifacts, examples, and notebook integration. For Gemma 4 12B, the confirmed source-backed availability is the official google/gemma-4 collection rather than a managed per-token API endpoint.

Kaggle Models is Google Kaggle's model hub for discovering and downloading public machine learning models, including official Google Gemma weights for local and notebook-based use.

Pricing on Kaggle Models

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsAudio

About Gemma 4 12B IT

Instruction-tuned 12B Gemma 4 model with native text, image, video, and audio input through an encoder-free unified architecture. It runs on 16 GB VRAM in BF16, supports a 256K context window, configurable thinking mode, function calling, structured outputs, and 140+ languages, making it the mid-sized Gemma 4 option between E4B and the 26B MoE.

Model Specs

Released2026-06-03
Parameters11.9B
Context256k
Architectureencoder_free_unified_multimodal
Knowledge cutoff2025-01

More Models on Kaggle Models

Provider

Kaggle Models

Google