Last refreshed 2026-05-01. Next refresh: weekly.
Why use DeepSeek R1 Distill Llama 8B on NVIDIA NIM?
NVIDIA NIM offers DeepSeek R1 Distill Llama 8B with competitive pricing. NVIDIA NIM is NVIDIA's deployment platform for GPU-accelerated inference microservices.
Compare DeepSeek R1 Distill Llama 8B across 2 providers to find the best fit for your use caseInput / 1M
-
Output / 1M
-
Cache
Not sourced
Batch
Not sourced
Setup recipe
Docs fallbackInstall
Use the provider REST API or SDKAuth
Create a provider API keyCall
model: deepseek-ai/deepseek-r1-distill-llama-8bModel ID
deepseek-ai/deepseek-r1-distill-llama-8bRequest example
Curated snippets for this provider are not sourced yet. Use NVIDIA NIM documentation with model ID
deepseek-ai/deepseek-r1-distill-llama-8b.Gotchas
- Use provider model ID "deepseek-ai/deepseek-r1-distill-llama-8b", not the LLMReference slug "deepseek-r1-distill-llama-8b".
Compare DeepSeek R1 Distill Llama 8B Across Providers
| Provider | Input (per 1M) | Output (per 1M) |
|---|---|---|
| Fireworks AI | $0.20 | $0.20 |
| NVIDIA NIM | — | — |
Pricing
| Type | Rate |
|---|---|
| GPU Hour Rate | $1.00/GPU·hr |
| GPU Config | 1xH100 |
Capabilities
Reasoning
About DeepSeek R1 Distill Llama 8B
Distilled DeepSeek R1 reasoning encoded into Llama 8B architecture.
FAQ
What is the context window for DeepSeek R1 Distill Llama 8B on NVIDIA NIM?
DeepSeek R1 Distill Llama 8B supports a 128,000 token context window on NVIDIA NIM.
How does NVIDIA NIM compare to other DeepSeek R1 Distill Llama 8B providers?
DeepSeek R1 Distill Llama 8B is available from 2 providers. The cheapest input pricing is $0.2/1M tokens from Fireworks AI.
What API model ID do I use for DeepSeek R1 Distill Llama 8B on NVIDIA NIM?
Use the model ID deepseek-ai/deepseek-r1-distill-llama-8b when calling NVIDIA NIM's API.
Who created DeepSeek R1 Distill Llama 8B?
DeepSeek R1 Distill Llama 8B was created by DeepSeek as part of the DeepSeek R1 model family.
Is DeepSeek R1 Distill Llama 8B open source?
DeepSeek R1 Distill Llama 8B is open source according to the seed data.
Model Specs
Released2025-01-20
Parameters8B
Context128K
ArchitectureDecoder Only