DeepSeek R1 Distill Qwen 14B
Open Source
About
Distilled DeepSeek R1 with reasoning in Qwen 14B for mid-scale inference.
Capabilities
VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution
Providers(2)
Compare all →| Provider | Input (per 1M) | Output (per 1M) | Type | |
|---|---|---|---|---|
| Fireworks AI | $0.2 | $0.2 | Serverless | |
| NVIDIA NIM | — | — | Serverless |
Specifications
FamilyDeepSeek R1
Released2025-01-20
Parameters14B
Context128K
ArchitectureDecoder Only
Specializationgeneral
Trainingmultistage
Fine-tuningtask_specific
Created by
Advancing artificial general intelligence (AGI).
Hangzhou, Zhejiang, China
Founded 2023
Website