LLM ReferenceLLM Reference

Qwen2.5

AlibabaHighlight
17 models2024–2025Up to 128K ctxFrom $0.04/1M input

About

The Qwen 2.5 large language model (LLM) family, developed by Alibaba Cloud's Qwen team, consists of decoder-only dense models that are open-sourced and come in seven different sizes ranging from 0.5 billion to 72 billion parameters 124. Built on a colossal dataset of up to 18 trillion tokens, these models showcase improvements from the Qwen 2 series, especially in knowledge, coding, and mathematical tasks 13. They boast enhanced capabilities in instruction following, long-text generation, and understanding structured data, including generating structured outputs like JSON 23. Other noteworthy features include improved system prompt handling for better role-playing and chatbot configuration 23. The models support over 29 languages, including Chinese and English, and include specialized versions like Qwen2.5-Coder and Qwen2.5-Math for specific tasks 236. Moreover, Qwen-Plus and Qwen-Turbo are accessible through Alibaba Cloud Model Studio APIs 3.

Specifications(17 models)

Qwen2.5 model specifications comparison
ModelReleasedContextParametersFn CallingTool UseStructured Outputs
Qwen2.5-72B2025-10128k72BYesYesNo
Qwen2.5-Max2025-01NoNoNo
Qwen2.5-VL-72B2025-0133K72BNoNoNo
Qwen2.5-0.5B2024-06128K490MNoNoNo
Qwen2.5-0.5B-Instruct2024-06128K490MNoNoNo
Qwen2.5-1.5B2024-06128K1.54BNoNoNo
Qwen2.5-1.5B-Instruct2024-06128K1.54BNoNoNo
Qwen2.5-3B2024-06128K3.09BNoNoNo
Qwen2.5-3B-Instruct2024-06128K3.09BNoNoNo
Qwen2.5-7B2024-06128K7.61BNoNoNo
Qwen2.5-7B-Instruct2024-06128K7.61BNoNoYes
Qwen2.5-14B2024-06128K14.7BNoNoNo
Qwen2.5-14B-Instruct2024-06128K14.7BNoNoYes
Qwen2.5-32B2024-06128K32.5BNoNoNo
Qwen2.5-32B-Instruct2024-06128K32.5BNoNoYes
Qwen2.5-72B2024-06128K72.7BNoNoNo
Qwen2.5-72B-Instruct2024-06128K72.7BNoNoYes

Available From(10 providers)

Pricing

Qwen2.5 model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Qwen2.5-7B-InstructOpenRouter$0.04$0.1Serverless
Qwen2.5-7B-InstructSiliconFlow$0.04$0.04Serverless
Qwen2.5-14B-InstructSiliconFlow$0.08$0.08Serverless
Qwen2.5-0.5B-InstructFireworks AI$0.1$0.1Serverless
Qwen2.5-1.5B-InstructFireworks AI$0.1$0.1Serverless
Qwen2.5-72B-InstructOpenRouter$0.12$0.39Serverless
Qwen2.5-0.5BBitdeer AI$0.12$0.36Serverless
Qwen2.5-1.5BBitdeer AI$0.12$0.36Serverless
Qwen2.5-7BBitdeer AI$0.12$0.36Serverless
Qwen2.5-14BBitdeer AI$0.12$0.36Serverless
Qwen2.5-32BBitdeer AI$0.12$0.36Serverless
Qwen2.5-7B-InstructTogether AI$0.15$0.15Serverless
Qwen2.5-32B-InstructSiliconFlow$0.15$0.15Serverless
Qwen2.5-72B-InstructChutes AI$0.18$0.54Serverless
Qwen2.5-14B-InstructFireworks AI$0.2$0.2Serverless
Qwen2.5-7BFireworks AI$0.2$0.2Serverless
Qwen2.5-14BFireworks AI$0.2$0.2Serverless
Qwen2.5-7B-InstructFireworks AI$0.2$0.2Serverless
Qwen2.5-72BBitdeer AI$0.2$0.6Serverless
Qwen2.5-72B-InstructNovita AI$0.2$0.6Serverless
Qwen2.5-72B-InstructSiliconFlow$0.28$0.28Serverless
Qwen2.5-32B-InstructReplicate API$0.6$0.6Serverless
Qwen2.5-32BFireworks AI$0.9$0.9Serverless
Qwen2.5-32B-InstructFireworks AI$0.9$0.9Serverless
Qwen2.5-72BFireworks AI$0.9$0.9Serverless
Qwen2.5-72B-InstructFireworks AI$0.9$0.9Serverless
Qwen2.5-72B-InstructReplicate API$1.3$1.3Serverless
Qwen2.5-7B-InstructDeepInfra$3$3Serverless
Qwen2.5-14B-InstructDeepInfra$10$10Serverless
Qwen2.5-72B-InstructDeepInfra$23$23Serverless

Comparisons

All comparisons →

Frequently Asked Questions

What is Qwen2.5?
The Qwen 2.5 large language model (LLM) family, developed by Alibaba Cloud's Qwen team, consists of decoder-only dense models that are open-sourced and come in seven different sizes ranging from 0.5 billion to 72 billion parameters 124. Built on a colossal dataset of up to 18 trillion tokens, these models showcase improvements from the Qwen 2 series, especially in knowledge, coding, and mathematical tasks 13. They boast enhanced capabilities in instruction following, long-text generation, and understanding structured data, including generating structured outputs like JSON 23. Other noteworthy features include improved system prompt handling for better role-playing and chatbot configuration 23. The models support over 29 languages, including Chinese and English, and include specialized versions like Qwen2.5-Coder and Qwen2.5-Math for specific tasks 236. Moreover, Qwen-Plus and Qwen-Turbo are accessible through Alibaba Cloud Model Studio APIs 3.
How many models are in the Qwen2.5 family?
The Qwen2.5 family contains 17 models.
What is the latest Qwen2.5 model?
The latest model is Qwen2.5-72B, released in 2025-10.
How much does Qwen2.5 cost?
Qwen2.5 models range from $0.04/1M to $23/1M input tokens depending on the model and provider.

Models(17)