LLM ReferenceLLM Reference

DeepSeek VL

4 models2024From $0.05/1M input

About

DeepSeek-VL is an advanced open-source family of vision-language models crafted for real-world applications, offering 1.3B and 7B parameter sizes with both "base" and "chat" variants. A standout feature is its hybrid vision encoder, which efficiently handles 1024 x 1024 high-resolution images, balancing performance with low computational needs. The models prioritize robust language abilities by integrating vision-language data strategically during training, preventing any compromise on language performance. With a vast pretraining dataset sourced from Common Crawl, web code, e-books, and educational content, DeepSeek-VL achieves competitive or state-of-the-art results across various benchmarks. These models aim to bridge the open-source and closed-source performance gap, enhancing both user experience and real-world applicability, and are available on platforms like Hugging Face for easy access.

Specifications(4 models)

DeepSeek VL model specifications comparison
ModelReleasedParametersVisionMultimodal
DeepSeek VL 7B2024-037BYesYes
DeepSeek VL 1.3B2024-031.3BYesYes
DeepSeek VL 7B Chat2024-037BYesYes
DeepSeek VL 1.3B Chat2024-031.3BYesYes

Available From(1 provider)

Pricing

DeepSeek VL model pricing by provider
ModelProviderInput / 1MOutput / 1MType
DeepSeek VL 7BReplicate API$0.05$0.25Serverless

Frequently Asked Questions

What is DeepSeek VL?
DeepSeek-VL is an advanced open-source family of vision-language models crafted for real-world applications, offering 1.3B and 7B parameter sizes with both "base" and "chat" variants. A standout feature is its hybrid vision encoder, which efficiently handles 1024 x 1024 high-resolution images, balancing performance with low computational needs. The models prioritize robust language abilities by integrating vision-language data strategically during training, preventing any compromise on language performance. With a vast pretraining dataset sourced from Common Crawl, web code, e-books, and educational content, DeepSeek-VL achieves competitive or state-of-the-art results across various benchmarks. These models aim to bridge the open-source and closed-source performance gap, enhancing both user experience and real-world applicability, and are available on platforms like Hugging Face for easy access.
How many models are in the DeepSeek VL family?
The DeepSeek VL family contains 4 models.
What is the latest DeepSeek VL model?
The latest model is DeepSeek VL 7B, released in 2024-03.
How much does DeepSeek VL cost?
DeepSeek VL models are available at $0.05/1M input tokens through providers like Replicate API.

Models(4)