LLM Reference

Llama Guard Models by AI at Meta

AI at MetaLlama 2 CommunitySafety
6 models2023–2025Up to 164k ctxFrom $0.05/1M input

About

The Llama Guard family of LLMs, developed by Meta AI, offers content safety classification capabilities for managing human-AI interactions. These models work by scrutinizing both inputs (prompts) and outputs (responses) to flag potentially unsafe content, utilizing a comprehensive safety risk taxonomy 14. Initially focused on text, the Llama Guard 3 Vision model extended this functionality to multimodal inputs, including image analysis 2. These models are known for their performance, which equals or surpasses current content moderation solutions on renowned benchmarks 1. Moreover, they are instruction-tuned, offering adaptability to various use cases and safety frameworks 14. Llama Guard models, including version 3-8B and its variants, are accessible via Hugging Face 4.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

6 in view

Use when the workload needs safety, 164k context, and 12B parameters.

2025-04safety164k context12B parameters

Use when the workload needs safety, 128k context, and 1B parameters.

2024-09safety128k context1B parameters

Use when the workload needs safety, 128k context, and 1B parameters.

2024-09safety128k context1B parameters

Use when the workload needs safety, 8k context, and 8B parameters.

2024-07safety8k context8B parameters

Use when the workload needs safety, 8k context, and 8B parameters.

2024-04safety8k context8B parameters

Use when the workload needs safety, 2k context, and 7B parameters.

2023-12safety2k context7B parameters

Release Timeline

5 release groups
2025-04
1 current
Llama Guard 4 12B
safety164k context12B parameters
Current
2024-09
2 current
Llama Guard 3 11B Vision
safety128k context1B parameters
Current
Llama Guard 3 1B
safety128k context1B parameters
Current
2024-07
1 current
Llama Guard 3 8B
safety8k context8B parameters
Current
2024-04
1 current
Llama Guard 2 8B
safety8k context8B parameters
Current
2023-12
1 current
Llama Guard 7B
safety2k context7B parameters
Current

Specifications(6 models)

Llama Guard model specifications comparison
ModelReleasedContextParametersVisionStructured Outputs
Llama Guard 4 12B2025-04164k12BNoYes
Llama Guard 3 1B2024-09128k1BNoNo
Llama Guard 3 11B Vision2024-09128k1BYesNo
Llama Guard 3 8B2024-078k8BNoYes
Llama Guard 2 8B2024-048k8BNoNo
Llama Guard 7B2023-122k7BNoYes

Available From(8 providers)

Pricing

Llama Guard model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Llama Guard 2 8BReplicate API$0.05$0.25Serverless
Llama Guard 3 1BFireworks AI$0.1$0.1Serverless
Llama Guard 2 8BOctoAI API (Deprecated)$0.15$0.15Serverless
Llama Guard 4 12BOpenRouter$0.18$0.18Serverless
Llama Guard 2 8BFireworks AI$0.2$0.2Provisioned
Llama Guard 7BTogether AI$0.2$0.2Serverless
Llama Guard 7BFireworks AI$0.2$0.2Provisioned
Llama Guard 3 8BFireworks AI$0.2$0.2Serverless
Llama Guard 4 12BReplicate API$0.2$0.2Serverless
Llama Guard 3 8BReplicate API$0.3$0.3Serverless
Llama Guard 3 8BMicrosoft Foundry$0.37$1.1Provisioned
Llama Guard 3 8BOpenRouter$0.48$0.03Serverless
Llama Guard 3 8BCloudflare Workers AI$0.484$0.03Serverless

Frequently Asked Questions

What is Llama Guard used for?
Llama Guard is used for safety, vision and multimodal work, and structured outputs. The family description and listed model capabilities point to those workloads as the best fit.
How does Llama Guard compare to Chameleon?
Llama Guard by AI at Meta is strongest where you need safety, while Chameleon by AI at Meta is the closest related family to check for coding. Llama Guard has 6 listed variants and reaches up to 164k context, while Chameleon reaches up to 4k context, so compare the specs and pricing tables before choosing a production model.
Which Llama Guard model should I use?
For the lowest listed input price, start with Llama Guard 2 8B through Replicate API at $0.05/1M input tokens. For the most capable/latest local choice, evaluate Llama Guard 4 12B with 164k context and structured outputs.

Models(6)