LLM Reference

DeepSeek R1 Zero

Capabilities

MultimodalFunction CallingTool UseJSON Mode

Specifications

Released2025-01-20
Parameters671B, 37B Active
Context128K
ArchitectureMixture of Experts
Specializationgeneral