DeepSeek Coder V2 Models by DeepSeek
Details
About
DeepSeek Coder V2 is an open-source family of Mixture-of-Experts (MoE) code language models crafted specifically for code-related tasks. It builds on the advancements of the DeepSeek V2 model, featuring notable improvements in code-specific tasks and reasoning capabilities. These models were trained on an additional 6 trillion tokens, enhancing their skills in coding and mathematical reasoning while maintaining strong general language performance. Key enhancements include support for over 338 programming languages, a significant jump from the 86 supported in earlier iterations, and an increased context length of up to 128K tokens. The family offers a variety of models such as smaller "Lite" versions for projects with limited computational resources, and larger models for complex tasks. Available on Hugging Face, these models can be accessed through their API or chat interface, making them easily deployable across diverse coding environments 123.
Current Variants
Use-when guidance is derived from seed capabilities, context, release, and replacement fields.
Use when the workload needs code, 128k context, and 236B parameters.
Use when the workload needs code, 128k context, and 236B parameters.
Use when the workload needs code, 128k context, and 16B parameters.
Use when the workload needs code, 128k context, and 236B parameters.
Use when the workload needs code, 128k context, and 16B parameters.
| Model | Use when | Released | Signals | Status |
|---|---|---|---|---|
| DeepSeek Coder V2 Instruct (0724) | Use when the workload needs code, 128k context, and 236B parameters. | 2024-07 | code128k context236B parameters | Current |
| DeepSeek Coder V2 | Use when the workload needs code, 128k context, and 236B parameters. | 2024-06 | code128k context236B parameters | Current |
| DeepSeek Coder V2 Lite | Use when the workload needs code, 128k context, and 16B parameters. | 2024-06 | code128k context16B parameters | Current |
| DeepSeek Coder V2 Instruct | Use when the workload needs code, 128k context, and 236B parameters. | 2024-06 | code128k context236B parameters | Current |
| DeepSeek Coder V2 Lite Instruct | Use when the workload needs code, 128k context, and 16B parameters. | 2024-06 | code128k context16B parameters | Current |
Release Timeline
2 release groupsSpecifications(5 models)
| Model | Released | Context | Parameters |
|---|---|---|---|
| DeepSeek Coder V2 Instruct (0724) | 2024-07 | 128k | 236B |
| DeepSeek Coder V2 | 2024-06 | 128k | 236B |
| DeepSeek Coder V2 Lite | 2024-06 | 128k | 16B |
| DeepSeek Coder V2 Instruct | 2024-06 | 128k | 236B |
| DeepSeek Coder V2 Lite Instruct | 2024-06 | 128k | 16B |
Available From(3 providers)
Pricing
| Model | Provider | Input / 1M | Output / 1M | Type |
|---|---|---|---|---|
| DeepSeek Coder V2 Lite Instruct | Novita AI | $0.12 | $0.36 | Serverless |
| DeepSeek Coder V2 | DeepSeek Platform | $0.14 | $0.28 | Serverless |
| DeepSeek Coder V2 Lite Instruct | Fireworks AI | $0.2 | $0.2 | Serverless |
| DeepSeek Coder V2 Lite | Fireworks AI | $0.5 | $0.5 | Serverless |
| DeepSeek Coder V2 | Fireworks AI | $1.2 | $1.2 | Serverless |
| DeepSeek Coder V2 Instruct | Fireworks AI | $1.2 | $1.2 | Serverless |
Frequently Asked Questions
- What is DeepSeek Coder V2 used for?
- DeepSeek Coder V2 is used for coding, code, and math-heavy prompts. The family description and listed model capabilities point to those workloads as the best fit.
- How does DeepSeek Coder V2 compare to Janus?
- DeepSeek Coder V2 by DeepSeek is strongest where you need coding, while Janus by DeepSeek is the closest related family to check for image generation. DeepSeek Coder V2 has 5 listed variants and reaches up to 128k context, so compare the specs and pricing tables before choosing a production model.
- Which DeepSeek Coder V2 model should I use?
- For the lowest listed input price, start with DeepSeek Coder V2 Lite Instruct through Novita AI at $0.12/1M input tokens. For the most capable/latest local choice, evaluate DeepSeek Coder V2 Instruct (0724) with 128k context.
Models(5)
DeepSeek Coder V2 Instruct (0724)
DeepSeek Coder V2
DeepSeek Coder V2 Lite
DeepSeek Coder V2 Instruct
DeepSeek Coder V2 Lite Instruct





