DeepSeek R1 Distill Llama 8B on Fireworks AI

Name: DeepSeek R1 Distill Llama 8B on Fireworks AI
Brand: DeepSeek
SKU: deepseek-r1-distill-llama-8b-fireworks-ai
Price: 0.2 USD

DeepSeek R1 · DeepSeek

ServerlessOpen Source

Last refreshed 2026-05-19. Next refresh: weekly.

Why use DeepSeek R1 Distill Llama 8B on Fireworks AI?

Fireworks AI offers DeepSeek R1 Distill Llama 8B with pay-as-you-go pricing at $0.20/1M input tokens. Fireworks AI offers a generative AI platform as a service, focusing on rapid product iteration and cost-efficient AI deployment.

Compare DeepSeek R1 Distill Llama 8B across 2 providers to find the best fit for your use case

Input / 1M

$0.20

Output / 1M

$0.20

Cache

Not sourced

Batch

Not sourced

Setup recipe

Python + curl

Install

pip install openai

Auth

export FIREWORKS_API_KEY=...

Call

import os
from openai import OpenAI
client = OpenAI(
    api_key=os.environ["FIREWORKS_API_KEY"],

Model ID

deepseek-r1-distill-llama-8b

Request example

import os
from openai import OpenAI

client = OpenAI(
    api_key=os.environ["FIREWORKS_API_KEY"],
    base_url="https://api.fireworks.ai/inference/v1"
)
response = client.chat.completions.create(
    model="deepseek-r1-distill-llama-8b",
    messages=[{"role": "user", "content": "Hello"}]
)
print(response.choices[0].message.content)

Gotchas

Fireworks model IDs use "accounts/fireworks/models/{model-name}" format, e.g. "accounts/fireworks/models/llama4-scout-instruct-basic" or "accounts/fireworks/models/deepseek-r1".
The examples expect FIREWORKS_API_KEY; rename it only if your application config maps the new variable.

Compare DeepSeek R1 Distill Llama 8B Across Providers

Provider	Input (per 1M)	Output (per 1M)
Fireworks AI	$0.20	$0.20
NVIDIA NIM	—	—

Pricing

Type	Price (per 1M)
Input tokens	$0.20
Output tokens	$0.20

Capabilities

Reasoning

About DeepSeek R1 Distill Llama 8B

DeepSeek R1 Distill Llama 8B is DeepSeek's DeepSeek R1 model with an optional reasoning mode. It offers a 128K-token context window with weights openly available for self-hosting.

FAQ

What does DeepSeek R1 Distill Llama 8B cost on Fireworks AI?

On Fireworks AI, DeepSeek R1 Distill Llama 8B costs $0.2 per 1M input tokens and $0.2 per 1M output tokens.

What is the context window for DeepSeek R1 Distill Llama 8B on Fireworks AI?

DeepSeek R1 Distill Llama 8B supports a 128k token context window on Fireworks AI.

How does Fireworks AI compare to other DeepSeek R1 Distill Llama 8B providers?

DeepSeek R1 Distill Llama 8B is available from 2 providers. The cheapest input pricing is $0.2/1M tokens from Fireworks AI.

Who created DeepSeek R1 Distill Llama 8B?

DeepSeek R1 Distill Llama 8B was created by DeepSeek as part of the DeepSeek R1 model family.

Is DeepSeek R1 Distill Llama 8B open source?

DeepSeek R1 Distill Llama 8B is open source under MIT according to the seed data.

Get Started

Model Card Docs Portal Pricing

Model Specs

Released2025-01-20

Parameters8B

Context128k

ArchitectureDecoder Only

Knowledge cutoff2023-12