LLM ReferenceLLM Reference

DeepSeek V2

deepseek-v2

Open Source

About

DeepSeek-V2 with Mixture of Experts. 236B total parameters with 21B active. 128K context window.

DeepSeek V2 has a 128K-token context window.

DeepSeek V2 input tokens at $0.14/1M, output at $0.28/1M.

Capabilities

VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution

Providers(1)

ProviderInput (per 1M)Output (per 1M)Type
DeepSeek Platform$0.14$0.28Serverless

Rankings

Specifications

Released2024-05-06
Parameters236B
Context128K
ArchitectureMixture of Experts
Specializationgeneral
Trainingfinetuned
Fine-tuningbase

Created by

Advancing artificial general intelligence (AGI).

Hangzhou, Zhejiang, China
Founded 2023
Website

Providers(1)