DeepSeek
About
The DeepSeek LLM family includes open-source large language models designed for exceptional language comprehension and diverse applications 410. These models shine in reasoning, coding, mathematics, and Chinese comprehension, often surpassing similar models in benchmarks 410. The lineup features base and chat models with parameter sizes of 7 billion and 67 billion, respectively 410. They are trained with a massive dataset of 2 trillion tokens in English and Chinese 410, and the architecture, based on the Llama model, enhances inference efficiency through Grouped-Query Attention in the 67B model 1. Available for research and commercial use, additional models like DeepSeek-Coder and DeepSeek-VL cater to code generation and vision-language tasks, respectively 89.
Specifications(9 models)
| Model | Released | Context | Parameters | Structured Outputs | Code Exec |
|---|---|---|---|---|---|
| Together DeepSeek-V3.1 | 2026-01 | 200k | 671B | No | No |
| DeepSeek V3.2 Speciale | 2025-04 | 164K | — | Yes | No |
| DeepSeek V3.2 Exp | 2025-04 | 164K | — | Yes | Yes |
| DeepSeek V3.1 Terminus | 2025-04 | 164K | — | Yes | No |
| Together AI Deepseek-LLM-67B-Chat | 2024-01 | 4K | 67B | Yes | No |
| DeepSeek 67B Chat | 2023-11 | — | 67B | Yes | No |
| DeepSeek 7B Chat | 2023-11 | — | 7B | No | No |
| DeepSeek 67B | 2023-11 | 4K | 67B | No | No |
| DeepSeek 7B | 2023-11 | 4K | 7B | No | No |
Available From(4 providers)
Pricing
| Model | Provider | Input / 1M | Output / 1M | Type |
|---|---|---|---|---|
| DeepSeek V3.1 Terminus | OpenRouter | $0.21 | $0.79 | Serverless |
| DeepSeek V3.2 Exp | OpenRouter | $0.27 | $0.41 | Serverless |
| DeepSeek V3.2 Speciale | DeepSeek Platform | $0.28 | $0.42 | Serverless |
| DeepSeek V3.2 Exp | DeepSeek Platform | $0.28 | $0.42 | Serverless |
| DeepSeek V3.2 Speciale | OpenRouter | $0.4 | $1.2 | Serverless |
| Together AI Deepseek-LLM-67B-Chat | Together AI | $0.6 | $0.6 | Serverless |
| DeepSeek 67B Chat | Together AI | $0.9 | $0.9 | Serverless |
Frequently Asked Questions
- What is DeepSeek?
- The DeepSeek LLM family includes open-source large language models designed for exceptional language comprehension and diverse applications 410. These models shine in reasoning, coding, mathematics, and Chinese comprehension, often surpassing similar models in benchmarks 410. The lineup features base and chat models with parameter sizes of 7 billion and 67 billion, respectively 410. They are trained with a massive dataset of 2 trillion tokens in English and Chinese 410, and the architecture, based on the Llama model, enhances inference efficiency through Grouped-Query Attention in the 67B model 1. Available for research and commercial use, additional models like DeepSeek-Coder and DeepSeek-VL cater to code generation and vision-language tasks, respectively 89.
- How many models are in the DeepSeek family?
- The DeepSeek family contains 9 models.
- What is the latest DeepSeek model?
- The latest model is Together DeepSeek-V3.1, released in 2026-01.
- How much does DeepSeek cost?
- DeepSeek models range from $0.21/1M to $0.9/1M input tokens depending on the model and provider.
Models(9)
Together DeepSeek-V3.1
DeepSeek V3.2 Speciale
DeepSeek V3.2 Exp
DeepSeek V3.1 Terminus
Together AI Deepseek-LLM-67B-Chat
DeepSeek 67B Chat
DeepSeek 7B Chat
DeepSeek 67B
DeepSeek 7B






