Last refreshed 2026-04-24. Next refresh: weekly.
Why use DeepSeek R1 on DeepInfra?
DeepInfra offers DeepSeek R1 with competitive pricing. DeepInfra is a cloud inference platform offering cost-effective access to open-source AI models.
Compare DeepSeek R1 across 13 providers to find the best fit for your use caseInput / 1M
-
Output / 1M
-
Cache
Not sourced
Batch
Not sourced
Setup recipe
Python + curlInstall
pip install openaiAuth
export DEEPINFRA_API_KEY=...Call
import os
from openai import OpenAI
client = OpenAI(
api_key=os.environ["DEEPINFRA_API_KEY"],Model ID
deepseek-r1Request example
import os
from openai import OpenAI
client = OpenAI(
api_key=os.environ["DEEPINFRA_API_KEY"],
base_url="https://api.deepinfra.com/v1/openai"
)
response = client.chat.completions.create(
model="deepseek-r1",
messages=[{"role": "user", "content": "Hello"}]
)
print(response.choices[0].message.content)Gotchas
- DeepInfra uses "organization/model-name" format, e.g. "meta-llama/Meta-Llama-3-8B-Instruct" or "mistralai/Mistral-7B-Instruct-v0.3". See the DeepInfra model catalog for exact IDs.
- The examples expect DEEPINFRA_API_KEY; rename it only if your application config maps the new variable.
Compare DeepSeek R1 Across Providers
| Provider | Input (per 1M) | Output (per 1M) |
|---|---|---|
| DeepSeek Platform | $0.55 | $2.19 |
| OpenRouter | $0.70 | $2.50 |
| Together AI | $3.00 | $7.00 |
| Fireworks AI | $0.56 | $1.68 |
| NVIDIA NIM | — | — |
Capabilities
ReasoningStructured OutputsCode Execution
About DeepSeek R1
DeepSeek R1: Reasoning-optimized model with extended thinking capabilities. 128K context.
FAQ
What is the context window for DeepSeek R1 on DeepInfra?
DeepSeek R1 supports a 128,000 token context window on DeepInfra.
How does DeepInfra compare to other DeepSeek R1 providers?
DeepSeek R1 is available from 13 providers. The cheapest input pricing is $0.1/1M tokens from Bitdeer AI.
Who created DeepSeek R1?
DeepSeek R1 was created by DeepSeek as part of the DeepSeek R1 model family.
Is DeepSeek R1 open source?
DeepSeek R1 is open source according to the seed data.
Model Specs
Released2025-01-20
Parameters671B, 37B Active
Context128K
ArchitectureDecoder Only