LLM ReferenceLLM Reference

Llama Guard

AI at MetaLlama 2 CommunitySafety
6 models2023–2025Up to 164K ctxFrom $0.05/1M input

About

The Llama Guard family of LLMs, developed by Meta AI, offers content safety classification capabilities for managing human-AI interactions. These models work by scrutinizing both inputs (prompts) and outputs (responses) to flag potentially unsafe content, utilizing a comprehensive safety risk taxonomy 14. Initially focused on text, the Llama Guard 3 Vision model extended this functionality to multimodal inputs, including image analysis 2. These models are known for their performance, which equals or surpasses current content moderation solutions on renowned benchmarks 1. Moreover, they are instruction-tuned, offering adaptability to various use cases and safety frameworks 14. Llama Guard models, including version 3-8B and its variants, are accessible via Hugging Face 4.

Specifications(6 models)

Llama Guard model specifications comparison
ModelReleasedContextParametersVisionStructured Outputs
Llama Guard 4 12B2025-04164KNoYes
Llama Guard 3 1B2024-091BNoNo
Llama Guard 3 11B Vision2024-091BYesNo
Llama Guard 3 8B2024-078K8BNoYes
Llama Guard 2 8B2024-048K8BNoNo
Llama Guard 7B2023-122K7BNoYes

Available From(8 providers)

Pricing

Llama Guard model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Llama Guard 2 8BReplicate API$0.05$0.25Serverless
Llama Guard 3 1BFireworks AI$0.1$0.1Serverless
Llama Guard 2 8BOctoAI API$0.15$0.15Serverless
Llama Guard 4 12BOpenRouter$0.18$0.18Serverless
Llama Guard 2 8BFireworks AI$0.2$0.2Provisioned
Llama Guard 7BTogether AI$0.2$0.2Serverless
Llama Guard 7BFireworks AI$0.2$0.2Provisioned
Llama Guard 3 8BFireworks AI$0.2$0.2Serverless
Llama Guard 4 12BReplicate API$0.2$0.2Serverless
Llama Guard 3 8BReplicate API$0.3$0.3Serverless
Llama Guard 3 8BMicrosoft Foundry$0.37$1.1Provisioned
Llama Guard 3 8BOpenRouter$0.48$0.03Serverless

Frequently Asked Questions

What is Llama Guard?
The Llama Guard family of LLMs, developed by Meta AI, offers content safety classification capabilities for managing human-AI interactions. These models work by scrutinizing both inputs (prompts) and outputs (responses) to flag potentially unsafe content, utilizing a comprehensive safety risk taxonomy 14. Initially focused on text, the Llama Guard 3 Vision model extended this functionality to multimodal inputs, including image analysis 2. These models are known for their performance, which equals or surpasses current content moderation solutions on renowned benchmarks 1. Moreover, they are instruction-tuned, offering adaptability to various use cases and safety frameworks 14. Llama Guard models, including version 3-8B and its variants, are accessible via Hugging Face 4.
How many models are in the Llama Guard family?
The Llama Guard family contains 6 models.
What is the latest Llama Guard model?
The latest model is Llama Guard 4 12B, released in 2025-04.
How much does Llama Guard cost?
Llama Guard models range from $0.05/1M to $0.48/1M input tokens depending on the model and provider.

Models(6)