Last refreshed 2026-05-01. Next refresh: weekly.
Why use Llama Guard 4 12B on NVIDIA NIM?
NVIDIA NIM offers Llama Guard 4 12B with competitive pricing. NVIDIA NIM is NVIDIA's deployment platform for GPU-accelerated inference microservices.
Compare Llama Guard 4 12B across 3 providers to find the best fit for your use caseInput / 1M
-
Output / 1M
-
Cache
Not sourced
Batch
Not sourced
Setup recipe
Docs fallbackInstall
Use the provider REST API or SDKAuth
Create a provider API keyCall
model: meta/llama-guard-4-12bModel ID
meta/llama-guard-4-12bRequest example
Curated snippets for this provider are not sourced yet. Use NVIDIA NIM documentation with model ID
meta/llama-guard-4-12b.Gotchas
- Use provider model ID "meta/llama-guard-4-12b", not the LLMReference slug "llama-guard-4-12b".
Compare Llama Guard 4 12B Across Providers
| Provider | Input (per 1M) | Output (per 1M) |
|---|---|---|
| NVIDIA NIM | — | — |
| Replicate API | $0.20 | $0.20 |
| OpenRouter | $0.18 | $0.18 |
Pricing
| Type | Rate |
|---|---|
| GPU Hour Rate | $1.00/GPU·hr |
| GPU Config | 1xH100 |
Capabilities
Structured Outputs
About Llama Guard 4 12B
Meta: Llama Guard 4 12B available via OpenRouter. Pricing: $0.18/1M input, $0.18/1M output.
FAQ
What is the context window for Llama Guard 4 12B on NVIDIA NIM?
Llama Guard 4 12B supports a 164,000 token context window on NVIDIA NIM.
How does NVIDIA NIM compare to other Llama Guard 4 12B providers?
Llama Guard 4 12B is available from 3 providers. The cheapest input pricing is $0.18/1M tokens from OpenRouter.
What API model ID do I use for Llama Guard 4 12B on NVIDIA NIM?
Use the model ID meta/llama-guard-4-12b when calling NVIDIA NIM's API.
Who created Llama Guard 4 12B?
Llama Guard 4 12B was created by AI at Meta as part of the Llama Guard model family.
Is Llama Guard 4 12B open source?
Llama Guard 4 12B is open source according to the seed data.
Model Specs
Released2025-04-05
Context164K
ArchitectureDecoder Only