Llama Guard 4 12B

Name: Llama Guard 4 12B
Author: AI at Meta

Released

2025-04-05

Last refreshed

2026-06-15

Status

Researched 41d ago

Open weightsCommercial use: conditionalRAGLong contextClassificationJSON / Tool use

Llama Guard 4 12B is worth evaluating for rag, long context, and classification when its provider route and context window match the workload.

Use it for

Teams evaluating rag, long context, and classification
Workloads that can use a 164k context window
Buyers comparing 3 tracked provider routes

Do not use it for

Vision or document-understanding workloads

Specifications

Family: Llama Guard
Released: 2025-04-05
Context: 164k
Parameters: 12B
Architecture: Decoder Only
Knowledge cutoff: 2024-08
Specialization: safety
Openness: Open weights
License: Llama 2 CommunityCommercial use: conditional

Created by

AI at Meta

Large-scale open-source AI for social technologies.

Menlo Park, California, United States

Founded 2013

Website

Pricing

Output / 1M

$0.180

Input / 1M

$0.180

Cheapest of 3 routes · OpenRouter

Providers(3)

NVIDIA NIM Replicate API OpenRouter

View 3 provider routes

Links

Website

About

Meta: Llama Guard 4 12B available via OpenRouter. Pricing: $0.18/1M input, $0.18/1M output.

Llama Guard 4 12B is an open-weight model in the Llama Guard family. The structured metadata tracks a 164k-token context window and structured outputs. This page tracks provider routes through NVIDIA NIM, Replicate API, and OpenRouter, with the cheapest tracked route listed at $0.18 input and $0.18 output per 1M tokens. No headline benchmark score is tracked for Llama Guard 4 12B yet.

Top use-case fit

RAG

Included by capability and metadata signals in the decision map.

Long context

Included by capability and metadata signals in the decision map.

Classification

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare all 3

Compare API pricing across 3 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Route
OpenRouter	$0.180	$0.180	Serverless
Replicate API	$0.200	$0.200	Serverless
NVIDIA NIM	-	-	ServerlessPartial

Available via routers & gateways(1)

NVIDIA LLM Router Blueprint

Router

NVIDIA's open-source AI blueprint for LLM routing that selects the optimal model per prompt via intent classification or neural auto-routing; being deprecated 2026-06-20.

Free OSSNVIDIA NIM