Prompt Guard
3 models2024–2025Up to 512 ctxFrom $0.03/1M input
About
Prompt Guard is a specialized text classification model created by Meta, focusing on the detection of malicious prompts, such as jailbreaks and prompt injections. Leveraging the mDeBERTa-v3-base transformer architecture, this lightweight model categorizes inputs into three distinct classes: benign, injection, and jailbreak. Its design ensures compatibility with a variety of large language models (LLMs) without requiring specific prompt structures. With a compact size of 86 million parameters, Prompt Guard integrates seamlessly into diverse applications. While it excels at identifying common attacks, it may need fine-tuning with application-specific data to improve its resilience against adaptive attacks 346.
Specifications(3 models)
| Model | Released | Context | Parameters | Structured Outputs |
|---|---|---|---|---|
| Llama Prompt Guard 2 22M | 2025-04 | 512 | 22M | Yes |
| Llama Prompt Guard 2 86M | 2025-04 | 512 | 86M | Yes |
| Prompt Guard 86M | 2024-07 | 512 | 279M | No |
Available From(2 providers)
Pricing
| Model | Provider | Input / 1M | Output / 1M | Type |
|---|---|---|---|---|
| Llama Prompt Guard 2 22M | GroqCloud | $0.03 | $0.03 | Serverless |
| Llama Prompt Guard 2 86M | GroqCloud | $0.04 | $0.04 | Serverless |
| Prompt Guard 86M | Microsoft Foundry | $0.05 | $0.05 | Provisioned |
Frequently Asked Questions
- What is Prompt Guard?
- Prompt Guard is a specialized text classification model created by Meta, focusing on the detection of malicious prompts, such as jailbreaks and prompt injections. Leveraging the mDeBERTa-v3-base transformer architecture, this lightweight model categorizes inputs into three distinct classes: benign, injection, and jailbreak. Its design ensures compatibility with a variety of large language models (LLMs) without requiring specific prompt structures. With a compact size of 86 million parameters, Prompt Guard integrates seamlessly into diverse applications. While it excels at identifying common attacks, it may need fine-tuning with application-specific data to improve its resilience against adaptive attacks 346.
- How many models are in the Prompt Guard family?
- The Prompt Guard family contains 3 models.
- What is the latest Prompt Guard model?
- The latest model is Llama Prompt Guard 2 22M, released in 2025-04.
- How much does Prompt Guard cost?
- Prompt Guard models range from $0.03/1M to $0.05/1M input tokens depending on the model and provider.






