DeepSeek Coder V2
About
DeepSeek Coder V2 is an open-source family of Mixture-of-Experts (MoE) code language models crafted specifically for code-related tasks. It builds on the advancements of the DeepSeek V2 model, featuring notable improvements in code-specific tasks and reasoning capabilities. These models were trained on an additional 6 trillion tokens, enhancing their skills in coding and mathematical reasoning while maintaining strong general language performance. Key enhancements include support for over 338 programming languages, a significant jump from the 86 supported in earlier iterations, and an increased context length of up to 128K tokens. The family offers a variety of models such as smaller "Lite" versions for projects with limited computational resources, and larger models for complex tasks. Available on Hugging Face, these models can be accessed through their API or chat interface, making them easily deployable across diverse coding environments 123.
Specifications(5 models)
| Model | Released | Context | Parameters |
|---|---|---|---|
| DeepSeek Coder V2 Instruct (0724) | 2024-07 | 128K | 236B |
| DeepSeek Coder V2 | 2024-06 | 128K | 236B |
| DeepSeek Coder V2 Lite | 2024-06 | 128K | 16B |
| DeepSeek Coder V2 Instruct | 2024-06 | 128K | 236B |
| DeepSeek Coder V2 Lite Instruct | 2024-06 | 128K | 16B |
Available From(3 providers)
Pricing
| Model | Provider | Input / 1M | Output / 1M | Type |
|---|---|---|---|---|
| DeepSeek Coder V2 Lite Instruct | Novita AI | $0.12 | $0.36 | Serverless |
| DeepSeek Coder V2 | DeepSeek Platform | $0.14 | $0.28 | Serverless |
| DeepSeek Coder V2 Lite Instruct | Fireworks AI | $0.2 | $0.2 | Serverless |
| DeepSeek Coder V2 Lite | Fireworks AI | $0.5 | $0.5 | Serverless |
| DeepSeek Coder V2 | Fireworks AI | $1.2 | $1.2 | Serverless |
| DeepSeek Coder V2 Instruct | Fireworks AI | $1.2 | $1.2 | Serverless |
Frequently Asked Questions
- What is DeepSeek Coder V2?
- DeepSeek Coder V2 is an open-source family of Mixture-of-Experts (MoE) code language models crafted specifically for code-related tasks. It builds on the advancements of the DeepSeek V2 model, featuring notable improvements in code-specific tasks and reasoning capabilities. These models were trained on an additional 6 trillion tokens, enhancing their skills in coding and mathematical reasoning while maintaining strong general language performance. Key enhancements include support for over 338 programming languages, a significant jump from the 86 supported in earlier iterations, and an increased context length of up to 128K tokens. The family offers a variety of models such as smaller "Lite" versions for projects with limited computational resources, and larger models for complex tasks. Available on Hugging Face, these models can be accessed through their API or chat interface, making them easily deployable across diverse coding environments 123.
- How many models are in the DeepSeek Coder V2 family?
- The DeepSeek Coder V2 family contains 5 models.
- What is the latest DeepSeek Coder V2 model?
- The latest model is DeepSeek Coder V2 Instruct (0724), released in 2024-07.
- How much does DeepSeek Coder V2 cost?
- DeepSeek Coder V2 models range from $0.12/1M to $1.2/1M input tokens depending on the model and provider.






