LLM ReferenceLLM Reference

Qwen1.5

AlibabaQianwenOpen SourceHighlight
11 models2024Up to 128K ctxFrom $0.05/1M input

About

The Qwen1.5 family is an advanced series of large language models (LLMs) developed by Alibaba Cloud, serving as a beta precursor to the Qwen2 series 134. This collection includes eight model sizes, scaling from 0.5 billion to 72 billion parameters, and features a 14-billion parameter Mixture of Experts (MoE) model. Available in both base and fine-tuned chat variants, these models offer key advancements such as enhanced human-aligned responses, stronger multilingual support across varied languages, and an extended context length capability of up to 32,768 tokens. Designed for user convenience, the Qwen1.5 models integrate effortlessly with popular frameworks like Hugging Face Transformers, vLLM, and llama.cpp. Additionally, there is a specialized CodeQwen1.5 model focused on code generation with support for up to 64K tokens 913.

Specifications(11 models)

Qwen1.5 model specifications comparison
ModelReleasedContextParametersVisionStructured Outputs
Qwen-Max2024-05128KYesYes
Qwen1.5-110B2024-04110BNoYes
Qwen1.5-MoE-A2.7B2024-032.7BNoNo
Qwen1.5-72B2024-0272BNoYes
Qwen1.5-32B2024-0232BNoYes
Qwen1.5-14B2024-0214BNoNo
Qwen1.5-7B2024-027BNoYes
Qwen1.5-4B2024-024BNoYes
Qwen1.5-1.8B2024-021.8BNoYes
Qwen1.5-0.5B2024-020.5BNoYes
DeepInfra Qwen1.5-72B-Chat2024-0233K72BNoYes

Available From(8 providers)

Pricing

Qwen1.5 model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Qwen1.5-7BReplicate API$0.05$0.25Serverless
Qwen1.5-4BReplicate API$0.05$0.25Serverless
Qwen1.5-1.8BReplicate API$0.05$0.25Serverless
Qwen1.5-0.5BReplicate API$0.05$0.25Serverless
Qwen1.5-0.5BTogether AI$0.1$0.1Serverless
Qwen1.5-1.8BTogether AI$0.1$0.1Serverless
Qwen1.5-4BTogether AI$0.1$0.1Serverless
Qwen1.5-14BReplicate API$0.1$0.5Serverless
Qwen1.5-7BTogether AI$0.2$0.2Serverless
Qwen1.5-32BReplicate API$0.2$1Serverless
DeepInfra Qwen1.5-72B-ChatDeepInfra$0.45$0.65Serverless
Qwen1.5-72BReplicate API$0.65$2.75Serverless
Qwen1.5-32BTogether AI$0.8$0.8Serverless
Qwen1.5-72BFireworks AI$0.9$0.9Provisioned
Qwen1.5-72BTogether AI$0.9$0.9Serverless
Qwen-MaxOpenRouter$1.04$4.16Serverless
Qwen1.5-110BMicrosoft Foundry$1.5$2.5Provisioned
Qwen1.5-110BTogether AI$1.8$1.8Serverless

Frequently Asked Questions

What is Qwen1.5?
The Qwen1.5 family is an advanced series of large language models (LLMs) developed by Alibaba Cloud, serving as a beta precursor to the Qwen2 series 134. This collection includes eight model sizes, scaling from 0.5 billion to 72 billion parameters, and features a 14-billion parameter Mixture of Experts (MoE) model. Available in both base and fine-tuned chat variants, these models offer key advancements such as enhanced human-aligned responses, stronger multilingual support across varied languages, and an extended context length capability of up to 32,768 tokens. Designed for user convenience, the Qwen1.5 models integrate effortlessly with popular frameworks like Hugging Face Transformers, vLLM, and llama.cpp. Additionally, there is a specialized CodeQwen1.5 model focused on code generation with support for up to 64K tokens 913.
How many models are in the Qwen1.5 family?
The Qwen1.5 family contains 11 models.
What is the latest Qwen1.5 model?
The latest model is Qwen-Max, released in 2024-05.
How much does Qwen1.5 cost?
Qwen1.5 models range from $0.05/1M to $1.8/1M input tokens depending on the model and provider.

Models(11)