LLM ReferenceLLM Reference

Prompt Guard

3 models2024–2025Up to 512 ctxFrom $0.03/1M input

About

Prompt Guard is a specialized text classification model created by Meta, focusing on the detection of malicious prompts, such as jailbreaks and prompt injections. Leveraging the mDeBERTa-v3-base transformer architecture, this lightweight model categorizes inputs into three distinct classes: benign, injection, and jailbreak. Its design ensures compatibility with a variety of large language models (LLMs) without requiring specific prompt structures. With a compact size of 86 million parameters, Prompt Guard integrates seamlessly into diverse applications. While it excels at identifying common attacks, it may need fine-tuning with application-specific data to improve its resilience against adaptive attacks 346.

Specifications(3 models)

Prompt Guard model specifications comparison
ModelReleasedContextParametersStructured Outputs
Llama Prompt Guard 2 22M2025-0451222MYes
Llama Prompt Guard 2 86M2025-0451286MYes
Prompt Guard 86M2024-07512279MNo

Available From(2 providers)

Pricing

Prompt Guard model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Llama Prompt Guard 2 22MGroqCloud$0.03$0.03Serverless
Llama Prompt Guard 2 86MGroqCloud$0.04$0.04Serverless
Prompt Guard 86MMicrosoft Foundry$0.05$0.05Provisioned

Frequently Asked Questions

What is Prompt Guard?
Prompt Guard is a specialized text classification model created by Meta, focusing on the detection of malicious prompts, such as jailbreaks and prompt injections. Leveraging the mDeBERTa-v3-base transformer architecture, this lightweight model categorizes inputs into three distinct classes: benign, injection, and jailbreak. Its design ensures compatibility with a variety of large language models (LLMs) without requiring specific prompt structures. With a compact size of 86 million parameters, Prompt Guard integrates seamlessly into diverse applications. While it excels at identifying common attacks, it may need fine-tuning with application-specific data to improve its resilience against adaptive attacks 346.
How many models are in the Prompt Guard family?
The Prompt Guard family contains 3 models.
What is the latest Prompt Guard model?
The latest model is Llama Prompt Guard 2 22M, released in 2025-04.
How much does Prompt Guard cost?
Prompt Guard models range from $0.03/1M to $0.05/1M input tokens depending on the model and provider.

Models(3)