Pricing
| Type | Price (per 1M) |
|---|---|
| Input tokens | $0.90 |
| Output tokens | $0.90 |
Capabilities
About DeepSeek Coder 33B
DeepSeek Coder 33B is a cutting-edge large language model engineered specifically for code generation and completion. With 33 billion parameters, it is fine-tuned on an extensive dataset featuring 2 billion tokens of instructional data and a total of 2 trillion tokens, predominantly consisting of code. This powerful foundation enables it to support over 80 programming languages. Its capabilities include generating code snippets, completing and infilling code with a 16K context window, and accurately following detailed instructions. Its architecture includes advanced Grouped-Query Attention, optimizing performance across coding tasks. While its size necessitates dedicated infrastructure for deployment, the model supports open-source and commercial use.