LLM ReferenceLLM Reference

DeepSeek R1 Zero

Open Source

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution

Rankings

Specifications

Released2025-01-20
Parameters671B, 37B Active
Context128K
ArchitectureMixture of Experts
Specializationgeneral
Trainingmultistage
Fine-tuningtask_specific

Created by

Advancing artificial general intelligence (AGI).

Hangzhou, Zhejiang, China
Founded 2023
Website