LLM ReferenceLLM Reference
NVIDIA NIM

Llama Guard 4 12B on NVIDIA NIM

Llama Guard · AI at Meta

ServerlessOpen Source

Last refreshed 2026-05-01. Next refresh: weekly.

Why use Llama Guard 4 12B on NVIDIA NIM?

NVIDIA NIM offers Llama Guard 4 12B with competitive pricing. NVIDIA NIM is NVIDIA's deployment platform for GPU-accelerated inference microservices.

Compare Llama Guard 4 12B across 3 providers to find the best fit for your use case
Input / 1M
-
Output / 1M
-
Cache
Not sourced
Batch
Not sourced

Setup recipe

Docs fallback
Install
Use the provider REST API or SDK
Auth
Create a provider API key
Call
model: meta/llama-guard-4-12b
Model ID
meta/llama-guard-4-12b

Request example

Curated snippets for this provider are not sourced yet. Use NVIDIA NIM documentation with model ID meta/llama-guard-4-12b.

Gotchas

  • Use provider model ID "meta/llama-guard-4-12b", not the LLMReference slug "llama-guard-4-12b".

Compare Llama Guard 4 12B Across Providers

ProviderInput (per 1M)Output (per 1M)
NVIDIA NIM
Replicate API$0.20$0.20
OpenRouter$0.18$0.18

Pricing

TypeRate
GPU Hour Rate$1.00/GPU·hr
GPU Config1xH100

Capabilities

Structured Outputs

About Llama Guard 4 12B

Meta: Llama Guard 4 12B available via OpenRouter. Pricing: $0.18/1M input, $0.18/1M output.

FAQ

What is the context window for Llama Guard 4 12B on NVIDIA NIM?

Llama Guard 4 12B supports a 164,000 token context window on NVIDIA NIM.

How does NVIDIA NIM compare to other Llama Guard 4 12B providers?

Llama Guard 4 12B is available from 3 providers. The cheapest input pricing is $0.18/1M tokens from OpenRouter.

What API model ID do I use for Llama Guard 4 12B on NVIDIA NIM?

Use the model ID meta/llama-guard-4-12b when calling NVIDIA NIM's API.

Who created Llama Guard 4 12B?

Llama Guard 4 12B was created by AI at Meta as part of the Llama Guard model family.

Is Llama Guard 4 12B open source?

Llama Guard 4 12B is open source according to the seed data.

Get Started

Model Specs

Released2025-04-05
Context164K
ArchitectureDecoder Only