LLM Reference

DeepSeek Coder V2 Models by DeepSeek

DeepSeekDeepSeek LicenseOpen weightsCoding
5 models2024Up to 128k ctxFrom $0.12/1M input

Details

ResearcherDeepSeek
Commercial useCommercial use allowed
Models5
Released2024
Max context128k

About

DeepSeek Coder V2 is an open-source family of Mixture-of-Experts (MoE) code language models crafted specifically for code-related tasks. It builds on the advancements of the DeepSeek V2 model, featuring notable improvements in code-specific tasks and reasoning capabilities. These models were trained on an additional 6 trillion tokens, enhancing their skills in coding and mathematical reasoning while maintaining strong general language performance. Key enhancements include support for over 338 programming languages, a significant jump from the 86 supported in earlier iterations, and an increased context length of up to 128K tokens. The family offers a variety of models such as smaller "Lite" versions for projects with limited computational resources, and larger models for complex tasks. Available on Hugging Face, these models can be accessed through their API or chat interface, making them easily deployable across diverse coding environments 123.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

5 in view

Use when the workload needs code, 128k context, and 236B parameters.

2024-07code128k context236B parameters

Use when the workload needs code, 128k context, and 236B parameters.

2024-06code128k context236B parameters

Use when the workload needs code, 128k context, and 16B parameters.

2024-06code128k context16B parameters

Use when the workload needs code, 128k context, and 236B parameters.

2024-06code128k context236B parameters

Use when the workload needs code, 128k context, and 16B parameters.

2024-06code128k context16B parameters

Release Timeline

2 release groups
2024-07
1 current
DeepSeek Coder V2 Instruct (0724)
code128k context236B parameters
Current
2024-06
4 current
DeepSeek Coder V2
code128k context236B parameters
Current
DeepSeek Coder V2 Instruct
code128k context236B parameters
Current
DeepSeek Coder V2 Lite
code128k context16B parameters
Current
DeepSeek Coder V2 Lite Instruct
code128k context16B parameters
Current

Specifications(5 models)

DeepSeek Coder V2 model specifications comparison
ModelReleasedContextParameters
DeepSeek Coder V2 Instruct (0724)2024-07128k236B
DeepSeek Coder V22024-06128k236B
DeepSeek Coder V2 Lite2024-06128k16B
DeepSeek Coder V2 Instruct2024-06128k236B
DeepSeek Coder V2 Lite Instruct2024-06128k16B

Available From(3 providers)

Pricing

DeepSeek Coder V2 model pricing by provider
ModelProviderInput / 1MOutput / 1MType
DeepSeek Coder V2 Lite InstructNovita AI$0.12$0.36Serverless
DeepSeek Coder V2DeepSeek Platform$0.14$0.28Serverless
DeepSeek Coder V2 Lite InstructFireworks AI$0.2$0.2Serverless
DeepSeek Coder V2 LiteFireworks AI$0.5$0.5Serverless
DeepSeek Coder V2Fireworks AI$1.2$1.2Serverless
DeepSeek Coder V2 InstructFireworks AI$1.2$1.2Serverless

Frequently Asked Questions

What is DeepSeek Coder V2 used for?
DeepSeek Coder V2 is used for coding, code, and math-heavy prompts. The family description and listed model capabilities point to those workloads as the best fit.
How does DeepSeek Coder V2 compare to Janus?
DeepSeek Coder V2 by DeepSeek is strongest where you need coding, while Janus by DeepSeek is the closest related family to check for image generation. DeepSeek Coder V2 has 5 listed variants and reaches up to 128k context, so compare the specs and pricing tables before choosing a production model.
Which DeepSeek Coder V2 model should I use?
For the lowest listed input price, start with DeepSeek Coder V2 Lite Instruct through Novita AI at $0.12/1M input tokens. For the most capable/latest local choice, evaluate DeepSeek Coder V2 Instruct (0724) with 128k context.