Last refreshed 2026-05-19. Next refresh: weekly.
Why use DeepSeek R1 Distill Llama 8B on NVIDIA NIM?
NVIDIA NIM offers DeepSeek R1 Distill Llama 8B with competitive pricing. NVIDIA NIM is NVIDIA's deployment platform for GPU-accelerated inference microservices.
Compare DeepSeek R1 Distill Llama 8B across 2 providers to find the best fit for your use caseSetup recipe
Docs fallbackUse the provider REST API or SDKCreate a provider API keymodel: deepseek-ai/deepseek-r1-distill-llama-8bdeepseek-ai/deepseek-r1-distill-llama-8bRequest example
deepseek-ai/deepseek-r1-distill-llama-8b.Gotchas
- Use provider model ID "deepseek-ai/deepseek-r1-distill-llama-8b", not the LLMReference slug "deepseek-r1-distill-llama-8b".
Compare DeepSeek R1 Distill Llama 8B Across Providers
| Provider | Input (per 1M) | Output (per 1M) |
|---|---|---|
| Fireworks AI | $0.20 | $0.20 |
| NVIDIA NIM | — | — |
Pricing
| Type | Rate |
|---|---|
| GPU Hour Rate | $1.00/GPU·hr |
| GPU Config | 1xH100 |
Capabilities
About DeepSeek R1 Distill Llama 8B
DeepSeek R1 Distill Llama 8B is DeepSeek's DeepSeek R1 model with an optional reasoning mode. It offers a 128K-token context window with weights openly available for self-hosting.
FAQ
What is the context window for DeepSeek R1 Distill Llama 8B on NVIDIA NIM?
DeepSeek R1 Distill Llama 8B supports a 128k token context window on NVIDIA NIM.
How does NVIDIA NIM compare to other DeepSeek R1 Distill Llama 8B providers?
DeepSeek R1 Distill Llama 8B is available from 2 providers. The cheapest input pricing is $0.2/1M tokens from Fireworks AI.
What API model ID do I use for DeepSeek R1 Distill Llama 8B on NVIDIA NIM?
Use the model ID deepseek-ai/deepseek-r1-distill-llama-8b when calling NVIDIA NIM's API.
Who created DeepSeek R1 Distill Llama 8B?
DeepSeek R1 Distill Llama 8B was created by DeepSeek as part of the DeepSeek R1 model family.
Is DeepSeek R1 Distill Llama 8B open source?
DeepSeek R1 Distill Llama 8B is open source under MIT according to the seed data.