LLM Reference

Qwen2.5-Coder Models by Alibaba

AlibabaApache 2.0Open sourceCoding
12 models2024Up to 128k ctxFrom $0.1/1M input

Details

ResearcherAlibaba
LicenseApache 2.0(OSI)
Commercial useCommercial use allowed
Models12
Released2024
Max context128k

Capabilities

Structured Outputs3 of 12 models
Code Execution2 of 12 models

About

The Qwen 2.5 Coder family is a sophisticated language model family designed for programming tasks and general computational reasoning. Developed with scalability in mind, these models range from 0.5 billion to 32 billion parameters, supporting extensive contexts up to 128,000 tokens. They demonstrate proficiency across 92 programming languages and excel in tasks like code generation, repair, and multi-language programming challenges. Remarkably, the 7-billion parameter variant outperforms much larger models like DeepSeek-Coder-V2-Lite on specific benchmarks, illustrating its efficiency and innovation. The family includes both base and instruction-tuned models. The instruction-tuned "Coder-Instruct" models enhance performance on various tasks and showcase superior generalization. These models are rigorously benchmarked on datasets such as McEval for multi-language programming and CRUXEval for reasoning, yielding exceptional results in code inference and mathematical tasks. The integration of diverse datasets maintains strong general capabilities, ensuring these models are versatile across technical and non-technical domains. Qwen 2.5 Coder is open-sourced under the Apache 2.0 license, encouraging community experimentation and deployment. The series' next iteration, with a 32-billion parameter model, is in development, promising even greater advancements in code intelligence. Practical applications, including code assistants and artifact generation tools, highlight its readiness for real-world scenarios, empowering developers with an accessible, powerful coding solution.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

12 in view

Use when the workload needs code, 128k context, and 14B parameters.

2024-11code128k context14B parameters

Use when the workload needs code, 128k context, and 14B parameters.

2024-11code128k context14B parameters

Use when the workload needs code, 128k context, and 32B parameters.

2024-11code128k context32B parameters

Use when the workload needs code, 128k context, and 32B parameters.

2024-11code128k context32B parameters

Use when the workload needs code, 32k context, and 3B parameters.

2024-11code32k context3B parameters

Use when the workload needs code, 32k context, and 3B parameters.

2024-11code32k context3B parameters

Use when the workload needs code, 32k context, and 500M parameters.

2024-11code32k context500M parameters

Use when the workload needs code, 32k context, and 500M parameters.

2024-11code32k context500M parameters

Use when the workload needs code, 32k context, and 1.5B parameters.

2024-09code32k context1.5B parameters

Use when the workload needs code, 32k context, and 1.5B parameters.

2024-09code32k context1.5B parameters

Use when the workload needs code, 128k context, and 7.6B parameters.

2024-09code128k context7.6B parameters

Use when the workload needs code, 128k context, and 7.6B parameters.

2024-09code128k context7.6B parameters

Release Timeline

2 release groups
2024-11
8 current
Qwen2.5-Coder-0.5B
code32k context500M parameters
Current
Qwen2.5-Coder-0.5B-Instruct
code32k context500M parameters
Current
Qwen2.5-Coder-14B
code128k context14B parameters
Current
Qwen2.5-Coder-14B-Instruct
code128k context14B parameters
Current
Qwen2.5-Coder-32B
code128k context32B parameters
Current
Qwen2.5-Coder-32B-Instruct
code128k context32B parameters
Current
Qwen2.5-Coder-3B
code32k context3B parameters
Current
Qwen2.5-Coder-3B-Instruct
code32k context3B parameters
Current
2024-09
4 current
Qwen2.5-Coder-1.5B
code32k context1.5B parameters
Current
Qwen2.5-Coder-1.5B-Instruct
code32k context1.5B parameters
Current
Qwen2.5-Coder-7B
code128k context7.6B parameters
Current
Qwen2.5-Coder-7B-Instruct
code128k context7.6B parameters
Current

Specifications(12 models)

Qwen2.5-Coder model specifications comparison
ModelReleasedContextParametersStructured OutputsCode Exec
Qwen2.5-Coder-14B2024-11128k14BNoNo
Qwen2.5-Coder-14B-Instruct2024-11128k14BNoNo
Qwen2.5-Coder-32B2024-11128k32BYesYes
Qwen2.5-Coder-32B-Instruct2024-11128k32BYesYes
Qwen2.5-Coder-3B2024-1132k3BNoNo
Qwen2.5-Coder-3B-Instruct2024-1132k3BNoNo
Qwen2.5-Coder-0.5B2024-1132k0.5BNoNo
Qwen2.5-Coder-0.5B-Instruct2024-1132k0.5BNoNo
Qwen2.5-Coder-1.5B2024-0932k1.54BNoNo
Qwen2.5-Coder-1.5B-Instruct2024-0932k1.54BNoNo
Qwen2.5-Coder-7B2024-09128k7.61BNoNo
Qwen2.5-Coder-7B-Instruct2024-09128k7.61BYesNo

Pricing

Qwen2.5-Coder model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Qwen2.5-Coder-1.5B-InstructFireworks AI$0.1$0.1Serverless
Qwen2.5-Coder-3B-InstructFireworks AI$0.1$0.1Serverless
Qwen2.5-Coder-32B-InstructSiliconFlow$0.18$0.18Serverless
Qwen2.5-Coder-32BDeepInfra$0.2$0.2Serverless
Qwen2.5-Coder-14B-InstructFireworks AI$0.2$0.2Serverless
Qwen2.5-Coder-7B-InstructFireworks AI$0.2$0.2Serverless
Qwen2.5-Coder-32B-InstructArcee AI$0.4$1.2Serverless
Qwen2.5-Coder-32B-InstructCloudflare Workers AI$0.66$1Serverless
Qwen2.5-Coder-32B-InstructOpenRouter$0.66$1Serverless
Qwen2.5-Coder-32B-InstructFireworks AI$0.9$0.9Serverless
Qwen2.5-Coder-32BFireworks AI$0.9$0.9Serverless

Frequently Asked Questions

What is Qwen2.5-Coder used for?
Qwen2.5-Coder is used for coding, code, and structured outputs. The family description and listed model capabilities point to those workloads as the best fit.
How does Qwen2.5-Coder compare to Tongyi DeepResearch?
Qwen2.5-Coder by Alibaba is strongest where you need coding, while Tongyi DeepResearch by Alibaba is the closest related family to check for adjacent model selection. Qwen2.5-Coder has 12 listed variants and reaches up to 128k context, while Tongyi DeepResearch reaches up to 131k context, so compare the specs and pricing tables before choosing a production model.
Which Qwen2.5-Coder model should I use?
For the lowest listed input price, start with Qwen2.5-Coder-1.5B-Instruct through Fireworks AI at $0.1/1M input tokens. For the most capable/latest local choice, evaluate Qwen2.5-Coder-32B with 128k context and structured outputs.