LLM Reference

Prompt Guard Models by AI at Meta

3 models2024–2025Up to 512 ctxFrom $0.03/1M input

About

Prompt Guard is a specialized text classification model created by Meta, focusing on the detection of malicious prompts, such as jailbreaks and prompt injections. Leveraging the mDeBERTa-v3-base transformer architecture, this lightweight model categorizes inputs into three distinct classes: benign, injection, and jailbreak. Its design ensures compatibility with a variety of large language models (LLMs) without requiring specific prompt structures. With a compact size of 86 million parameters, Prompt Guard integrates seamlessly into diverse applications. While it excels at identifying common attacks, it may need fine-tuning with application-specific data to improve its resilience against adaptive attacks 346.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

3 in view

Use when the workload needs safety, 512 context, and 22M parameters.

2025-04safety512 context22M parameters

Use when the workload needs safety, 512 context, and 86M parameters.

2025-04safety512 context86M parameters

Use when the workload needs safety, 512 context, and 279M parameters.

2024-07safety512 context279M parameters

Release Timeline

2 release groups
2025-04
2 current
Llama Prompt Guard 2 22M
safety512 context22M parameters
Current
Llama Prompt Guard 2 86M
safety512 context86M parameters
Current
2024-07
1 current
Prompt Guard 86M
safety512 context279M parameters
Current

Specifications(3 models)

Prompt Guard model specifications comparison
ModelReleasedContextParametersStructured Outputs
Llama Prompt Guard 2 22M2025-0451222MYes
Llama Prompt Guard 2 86M2025-0451286MYes
Prompt Guard 86M2024-07512279MNo

Available From(2 providers)

Pricing

Prompt Guard model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Llama Prompt Guard 2 22MGroqCloud$0.03$0.03Serverless
Llama Prompt Guard 2 86MGroqCloud$0.04$0.04Serverless
Prompt Guard 86MMicrosoft Foundry$0.05$0.05Provisioned

Frequently Asked Questions

What is Prompt Guard used for?
Prompt Guard is used for safety and structured outputs. The family description and listed model capabilities point to those workloads as the best fit.
How does Prompt Guard compare to Chameleon?
Prompt Guard by AI at Meta is strongest where you need safety, while Chameleon by AI at Meta is the closest related family to check for coding. Prompt Guard has 3 listed variants and reaches up to 512 context, while Chameleon reaches up to 4k context, so compare the specs and pricing tables before choosing a production model.
Which Prompt Guard model should I use?
For the lowest listed input price, start with Llama Prompt Guard 2 22M through GroqCloud at $0.03/1M input tokens. For the most capable/latest local choice, evaluate Llama Prompt Guard 2 22M with 512 context and structured outputs.

Models(3)