What is DeepSeek R1 used for?

DeepSeek R1 is used for reasoning, structured outputs, and code execution. The family description and listed model capabilities point to those workloads as the best fit.

How does DeepSeek R1 compare to Janus?

DeepSeek R1 by DeepSeek is strongest where you need reasoning, while Janus by DeepSeek is the closest related family to check for image generation. DeepSeek R1 has 11 listed variants and reaches up to 160k context, so compare the specs and pricing tables before choosing a production model.

Which DeepSeek R1 model should I use?

For the lowest listed input price, start with DeepSeek R1 through Bitdeer AI at $0.1/1M input tokens. For the most capable/latest local choice, evaluate DeepSeek R1 0528 with 130k context and reasoning and structured outputs.

DeepSeek R1 Models by DeepSeek

DeepSeekMITOpen sourceReasoningHighlight

11 models2024–2025Up to 160k ctxFrom $0.1/1M input

Details

ResearcherDeepSeek

LicenseMITOSI-approved

Commercial useCommercial use: permitted

Models11

Released2024–2025

Max context160k

Capabilities

ReasoningAll models

Structured Outputs4 of 11 models

Code Execution2 of 11 models

Links

Website

About

DeepSeek R1 is a family of large language models designed specifically for advanced reasoning tasks by DeepSeek, a leading Chinese AI firm. The initial release in this model line, DeepSeek-R1-Lite-Preview, is tailored to excel in logical inference, mathematical reasoning, and real-time problem-solving. This model introduces a "chain-of-thought" reasoning capability, allowing users to track the model's reasoning steps in solving complex problems. Notably, it performs comparably to OpenAI's o1-preview model on certain benchmarks like AIME and MATH. However, at the time of writing, independent verification is pending, as there is no API access or full code release yet. DeepSeek aims to ultimately provide an open-source version of the R1 model along with an accessible API. Initial tests showcase impressive capabilities, although some challenges remain as the model occasionally encounters difficulties with specific logic problems 12348.

Current Variants

Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.

11 in view

DeepSeek R1 0528Current

Use when the workload needs 130k context, reasoning, and structured outputs.

2025-05130k contextreasoningstructured outputs

DeepSeek R1Current

Use when the workload needs 128k context, reasoning, and structured outputs.

2025-01128k contextreasoningstructured outputs

DeepSeek R1 ZeroCurrent

Use when the workload needs 128k context and reasoning.

2025-01128k contextreasoning

DeepSeek R1 Distill Qwen-1.5BCurrent

Use when the workload needs 128k context, 1.5B parameters, and reasoning.

2025-01128k context1.5B parametersreasoning

DeepSeek R1 Distill Qwen-7BCurrent

Use when the workload needs 128k context, 7B parameters, and reasoning.

2025-01128k context7B parametersreasoning

DeepSeek R1 Distill Llama 8BCurrent

Use when the workload needs 128k context, 8B parameters, and reasoning.

2025-01128k context8B parametersreasoning

DeepSeek R1 Distill Qwen-14BCurrent

Use when the workload needs 128k context, 14B parameters, and reasoning.

2025-01128k context14B parametersreasoning

DeepSeek R1 Distill Qwen-32BCurrent

Use when the workload needs 128k context, 32B parameters, and reasoning.

2025-01128k context32B parametersreasoning

DeepSeek R1 Distill Llama 70BCurrent

Use when the workload needs 128k context, 70B parameters, and reasoning.

2025-01128k context70B parametersreasoning

DeepSeek R1 BasicCurrent

Use when the workload needs 160k context, 671B parameters, and reasoning.

2025-01160k context671B parametersreasoning

DeepSeek R1 LiteCurrent

Use when the workload needs 128k context and reasoning.

2024-11128k contextreasoning

Current DeepSeek R1 variants with use-when guidance and lifecycle status
Model	Use when	Released	Signals	Status
DeepSeek R1 0528	Use when the workload needs 130k context, reasoning, and structured outputs.	2025-05	130k contextreasoningstructured outputs	Current
DeepSeek R1	Use when the workload needs 128k context, reasoning, and structured outputs.	2025-01	128k contextreasoningstructured outputs	Current
DeepSeek R1 Zero	Use when the workload needs 128k context and reasoning.	2025-01	128k contextreasoning	Current
DeepSeek R1 Distill Qwen-1.5B	Use when the workload needs 128k context, 1.5B parameters, and reasoning.	2025-01	128k context1.5B parametersreasoning	Current
DeepSeek R1 Distill Qwen-7B	Use when the workload needs 128k context, 7B parameters, and reasoning.	2025-01	128k context7B parametersreasoning	Current
DeepSeek R1 Distill Llama 8B	Use when the workload needs 128k context, 8B parameters, and reasoning.	2025-01	128k context8B parametersreasoning	Current
DeepSeek R1 Distill Qwen-14B	Use when the workload needs 128k context, 14B parameters, and reasoning.	2025-01	128k context14B parametersreasoning	Current
DeepSeek R1 Distill Qwen-32B	Use when the workload needs 128k context, 32B parameters, and reasoning.	2025-01	128k context32B parametersreasoning	Current
DeepSeek R1 Distill Llama 70B	Use when the workload needs 128k context, 70B parameters, and reasoning.	2025-01	128k context70B parametersreasoning	Current
DeepSeek R1 Basic	Use when the workload needs 160k context, 671B parameters, and reasoning.	2025-01	160k context671B parametersreasoning	Current
DeepSeek R1 Lite	Use when the workload needs 128k context and reasoning.	2024-11	128k contextreasoning	Current

Release Timeline

3 release groups

2025-05

1 current

DeepSeek R1 0528

130k contextreasoningstructured outputs

Current

2025-01

9 current

DeepSeek R1

128k contextreasoningstructured outputs

Current

DeepSeek R1 Basic

160k context671B parametersreasoning

Current

DeepSeek R1 Distill Llama 70B

128k context70B parametersreasoning

Current

DeepSeek R1 Distill Llama 8B

128k context8B parametersreasoning

Current

DeepSeek R1 Distill Qwen-1.5B

128k context1.5B parametersreasoning

Current

DeepSeek R1 Distill Qwen-14B

128k context14B parametersreasoning

Current

DeepSeek R1 Distill Qwen-32B

128k context32B parametersreasoning

Current

DeepSeek R1 Distill Qwen-7B

128k context7B parametersreasoning

Current

DeepSeek R1 Zero

128k contextreasoning

Current

2024-11

1 current

DeepSeek R1 Lite

128k contextreasoning

Current

Specifications(11 models)

DeepSeek R1 model specifications comparison
Model	Released	Context	Parameters	Reasoning	Structured Outputs	Code Exec
DeepSeek R1 0528	2025-05	130k	685B total, 37B active (MoE)	Yes	Yes	Yes
DeepSeek R1	2025-01	128k	671B, 37B Active	Yes	Yes	Yes
DeepSeek R1 Zero	2025-01	128k	671B, 37B Active	Yes	No	No
DeepSeek R1 Distill Qwen-1.5B	2025-01	128k	1.5B	Yes	No	No
DeepSeek R1 Distill Qwen-7B	2025-01	128k	7B	Yes	No	No
DeepSeek R1 Distill Llama 8B	2025-01	128k	8B	Yes	No	No
DeepSeek R1 Distill Qwen-14B	2025-01	128k	14B	Yes	No	No
DeepSeek R1 Distill Qwen-32B	2025-01	128k	32B	Yes	Yes	No
DeepSeek R1 Distill Llama 70B	2025-01	128k	70B	Yes	Yes	No
DeepSeek R1 Basic	2025-01	160k	671B	Yes	No	No
DeepSeek R1 Lite	2024-11	128k	—	Yes	No	No

Available From(17 providers)

Arcee AI

AWS Bedrock

Bitdeer AI

Cloudflare Workers AI

Databricks Foundation Model Serving

DeepInfra

DeepSeek Platform

Fireworks AI +9 more

Pricing

DeepSeek R1 model pricing by provider
Model	Provider	Input / 1M	Output / 1M	Type
DeepSeek R1 Distill Qwen-1.5B	Fireworks AI	$0.1	$0.1	Serverless
DeepSeek R1	Bitdeer AI	$0.1	$0.3	Serverless
DeepSeek R1 Distill Qwen-14B	Novita AI	$0.15	$0.15	Serverless
DeepSeek R1 Distill Llama 8B	Fireworks AI	$0.2	$0.2	Serverless
DeepSeek R1 Distill Qwen-14B	Fireworks AI	$0.2	$0.2	Serverless
DeepSeek R1 Distill Qwen-7B	Fireworks AI	$0.2	$0.2	Serverless
DeepSeek R1	SiliconFlow	$0.25	$0.8	Serverless
DeepSeek R1 Distill Qwen-32B	OpenRouter	$0.29	$0.29	Serverless
DeepSeek R1 Distill Qwen-32B	Novita AI	$0.3	$0.3	Serverless
DeepSeek R1 Distill Llama 70B	Arcee AI	$0.35	$1.05	Serverless
DeepSeek R1 Distill Qwen-32B	Cloudflare Workers AI	$0.497	$4.881	Serverless
DeepSeek R1 0528	OpenRouter	$0.5	$2.15	Serverless
DeepSeek R1 0528	DeepInfra	$0.5	$2.15	Serverless
DeepSeek R1	DeepSeek Platform	$0.55	$2.19	Serverless
DeepSeek R1	Fireworks AI	$0.56	$1.68	Serverless
DeepSeek R1 0528	Fireworks AI	$0.56	$1.68	Serverless
DeepSeek R1 Basic	Fireworks AI	$0.56	$1.68	Serverless
DeepSeek R1 Distill Llama 70B	DeepInfra	$0.7	$0.8	Serverless
DeepSeek R1	OpenRouter	$0.7	$2.5	Serverless
DeepSeek R1 Distill Llama 70B	OpenRouter	$0.7	$0.8	Serverless
DeepSeek R1 0528	Novita AI	$0.7	$2.5	Serverless
DeepSeek R1 Distill Llama 70B	Novita AI	$0.8	$0.8	Serverless
DeepSeek R1 Distill Llama 70B	Fireworks AI	$0.9	$0.9	Serverless
DeepSeek R1 Distill Qwen-32B	Fireworks AI	$0.9	$0.9	Serverless
DeepSeek R1	AWS Bedrock	$1.35	$5.4	Serverless
DeepSeek R1 0528	GCP Vertex AI	$1.35	$5.4	Serverless
DeepSeek R1	GCP Vertex AI	$1.35	$5.4	Serverless
DeepSeek R1	Vercel AI Gateway	$1.35	$5.4	Serverless
DeepSeek R1	Together AI	$3	$7	Serverless
DeepSeek R1 0528	Together AI	$3	$7	Serverless
DeepSeek R1	Replicate API	$3.75	$10	Serverless

Popular comparisons in this family

Comparisons

All comparisons →

Frequently Asked Questions

What is DeepSeek R1 used for?: DeepSeek R1 is used for reasoning, structured outputs, and code execution. The family description and listed model capabilities point to those workloads as the best fit.
How does DeepSeek R1 compare to Janus?: DeepSeek R1 by DeepSeek is strongest where you need reasoning, while Janus by DeepSeek is the closest related family to check for image generation. DeepSeek R1 has 11 listed variants and reaches up to 160k context, so compare the specs and pricing tables before choosing a production model.
Which DeepSeek R1 model should I use?: For the lowest listed input price, start with DeepSeek R1 through Bitdeer AI at $0.1/1M input tokens. For the most capable/latest local choice, evaluate DeepSeek R1 0528 with 130k context and reasoning and structured outputs.

Models(11)

DeepSeek R1 0528

2025-05130k685B total, 37B active (MoE)7 providers

ReasoningOpen Source

DeepSeek R1

2025-01128k671B, 37B Active14 providers

ReasoningOpen Source

DeepSeek R1 Zero

2025-01128k671B, 37B Active

ReasoningOpen Source

DeepSeek R1 Distill Qwen-1.5B

2025-01128k1.5B1 provider

ReasoningOpen Source

DeepSeek R1 Distill Qwen-7B

2025-01128k7B2 providers

ReasoningOpen Source

DeepSeek R1 Distill Llama 8B

2025-01128k8B2 providers

ReasoningOpen Source

DeepSeek R1 Distill Qwen-14B

2025-01128k14B3 providers

ReasoningOpen Source

DeepSeek R1 Distill Qwen-32B

2025-01128k32B5 providers

ReasoningOpen Source

DeepSeek R1 Distill Llama 70B

2025-01128k70B5 providers

ReasoningOpen Source

DeepSeek R1 Basic

2025-01160k671B1 provider

ReasoningOpen Source

DeepSeek R1 Lite

2024-11128k

ReasoningOpen Source

DeepSeek R1 Models by DeepSeek

Details

Capabilities

Links

About

Current Variants

Release Timeline

Specifications(11 models)

Available From(17 providers)

Pricing

Popular comparisons in this family

Comparisons

Frequently Asked Questions

Related Model Families

Models(11)