LLM Reference

DeepSeek Coder Models by DeepSeek

DeepSeekCoding
9 models2023–2024Up to 16k ctxFrom $0.1/1M input

About

The DeepSeek Coder family includes a range of open-source code language models specifically designed for handling large codebases. Trained on an expansive dataset of 2 trillion tokens, primarily composed of code (87%) and a mix of English and Chinese natural language data (13%), these models are available in sizes from 1.3 billion to 33 billion parameters. This range gives users the flexibility to choose models that align with their computational resources and specific needs. With pre-training on a high-quality project-level code corpus and using a 16K window size, the models excel in code generation and infill tasks. They demonstrate state-of-the-art performance on various open-source code benchmarks and often outperform some proprietary models. Released under a permissive license, DeepSeek Coder models support both research and commercial applications, offering significant capabilities to developers in coding projects 145.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

8 in view1 retired

Use when the workload needs code, 16k context, and 33B parameters.

2024-03code16k context33B parameters

Use when the workload needs code, 16k context, and 33B parameters.

2024-03code16k context33B parameters

Use when the workload needs code, 16k context, and 6.7B parameters.

2024-03code16k context6.7B parameters

Use when the workload needs 16k context, 33B parameters, and structured outputs.

2024-0316k context33B parametersstructured outputs

Use when the workload needs code, 16k context, and 7B parameters.

2024-02code16k context7B parameters

Use when the workload needs code, 16k context, and 7B parameters.

2024-02code16k context7B parameters

Use when the workload needs code, 4k context, and 1.3B parameters.

2023-11code4k context1.3B parameters

Use when the workload needs code, 16k context, and 1.3B parameters.

2023-11code16k context1.3B parameters

Release Timeline

3 release groups
2024-03
4 current · 1 retired
DeepSeek Coder 33B
code16k context33B parameters
Current
DeepSeek Coder 33B Instruct
code16k context33B parameters
Current
DeepSeek Coder 6.7B
code4k context6.7B parameters
Archived
DeepSeek Coder 6.7B Instruct
code16k context6.7B parameters
Current
Together AI Deepseek-Coder-33B-Instruct
16k context33B parametersstructured outputs
Current
2024-02
2 current
DeepSeek Coder 7B V1.5
code16k context7B parameters
Current
DeepSeek Coder 7B V1.5 Instruct
code16k context7B parameters
Current
2023-11
2 current
DeepSeek Coder 1.3B
code4k context1.3B parameters
Current
DeepSeek Coder 1.3B Instruct
code16k context1.3B parameters
Current

Specifications(9 models)

DeepSeek Coder model specifications comparison
ModelReleasedContextParametersStructured Outputs
DeepSeek Coder 33B2024-0316k33BYes
DeepSeek Coder 33B Instruct2024-0316k33BNo
DeepSeek Coder 6.7B Instruct2024-0316k6.7BNo
Together AI Deepseek-Coder-33B-Instruct2024-0316k33BYes
DeepSeek Coder 7B V1.52024-0216k7BNo
DeepSeek Coder 7B V1.5 Instruct2024-0216k7BNo
DeepSeek Coder 1.3B2023-114k1.3BNo
DeepSeek Coder 1.3B Instruct2023-1116k1.3BNo

Available From(4 providers)

Pricing

DeepSeek Coder model pricing by provider
ModelProviderInput / 1MOutput / 1MType
DeepSeek Coder 1.3BFireworks AI$0.1$0.1Provisioned
DeepSeek Coder 7B V1.5Fireworks AI$0.2$0.2Provisioned
Together AI Deepseek-Coder-33B-InstructTogether AI$0.3$0.3Serverless
DeepSeek Coder 33BTogether AI$0.8$0.8Serverless
DeepSeek Coder 33BFireworks AI$0.9$0.9Provisioned
DeepSeek Coder 33B InstructFireworks AI$0.9$0.9Serverless

Frequently Asked Questions

What is DeepSeek Coder used for?
DeepSeek Coder is used for coding, code, and structured outputs. The family description and listed model capabilities point to those workloads as the best fit.
How does DeepSeek Coder compare to Janus?
DeepSeek Coder by DeepSeek is strongest where you need coding, while Janus by DeepSeek is the closest related family to check for image generation. DeepSeek Coder has 9 listed variants and reaches up to 16k context, so compare the specs and pricing tables before choosing a production model.
Which DeepSeek Coder model should I use?
For the lowest listed input price, start with DeepSeek Coder 1.3B through Fireworks AI at $0.1/1M input tokens. For the most capable/latest local choice, evaluate DeepSeek Coder 33B with 16k context and structured outputs.

Models(9)