DeepSeek VL
About
DeepSeek-VL is an advanced open-source family of vision-language models crafted for real-world applications, offering 1.3B and 7B parameter sizes with both "base" and "chat" variants. A standout feature is its hybrid vision encoder, which efficiently handles 1024 x 1024 high-resolution images, balancing performance with low computational needs. The models prioritize robust language abilities by integrating vision-language data strategically during training, preventing any compromise on language performance. With a vast pretraining dataset sourced from Common Crawl, web code, e-books, and educational content, DeepSeek-VL achieves competitive or state-of-the-art results across various benchmarks. These models aim to bridge the open-source and closed-source performance gap, enhancing both user experience and real-world applicability, and are available on platforms like Hugging Face for easy access.
Specifications(4 models)
| Model | Released | Parameters | Vision | Multimodal |
|---|---|---|---|---|
| DeepSeek VL 7B | 2024-03 | 7B | Yes | Yes |
| DeepSeek VL 1.3B | 2024-03 | 1.3B | Yes | Yes |
| DeepSeek VL 7B Chat | 2024-03 | 7B | Yes | Yes |
| DeepSeek VL 1.3B Chat | 2024-03 | 1.3B | Yes | Yes |
Available From(1 provider)
Pricing
| Model | Provider | Input / 1M | Output / 1M | Type |
|---|---|---|---|---|
| DeepSeek VL 7B | Replicate API | $0.05 | $0.25 | Serverless |
Frequently Asked Questions
- What is DeepSeek VL?
- DeepSeek-VL is an advanced open-source family of vision-language models crafted for real-world applications, offering 1.3B and 7B parameter sizes with both "base" and "chat" variants. A standout feature is its hybrid vision encoder, which efficiently handles 1024 x 1024 high-resolution images, balancing performance with low computational needs. The models prioritize robust language abilities by integrating vision-language data strategically during training, preventing any compromise on language performance. With a vast pretraining dataset sourced from Common Crawl, web code, e-books, and educational content, DeepSeek-VL achieves competitive or state-of-the-art results across various benchmarks. These models aim to bridge the open-source and closed-source performance gap, enhancing both user experience and real-world applicability, and are available on platforms like Hugging Face for easy access.
- How many models are in the DeepSeek VL family?
- The DeepSeek VL family contains 4 models.
- What is the latest DeepSeek VL model?
- The latest model is DeepSeek VL 7B, released in 2024-03.
- How much does DeepSeek VL cost?
- DeepSeek VL models are available at $0.05/1M input tokens through providers like Replicate API.






