Qwen2.5-72B-Instruct

Name: Qwen2.5-72B-Instruct
Author: Alibaba

Released

2024-06-07

Last refreshed

2026-06-30

Status

Researched 90d ago

Open sourceCommercial use: permittedCodingRAGLong contextClassificationJSON / Tool use

Qwen2.5-72B-Instruct is worth evaluating for coding, rag, and long context when its provider route and context window match the workload.

Use it for

Teams evaluating coding, rag, and long context
Workloads that can use a 128k context window
Buyers comparing 4 tracked provider routes

Do not use it for

Vision or document-understanding workloads

Specifications

Family: Qwen2.5
Released: 2024-06-07
Context: 128k
Parameters: 72.7B
Architecture: Decoder Only
Specialization: general
Openness: Open source
License: Apache 2.0OSI-approvedCommercial use: permitted
Weights: Available
Code: Unknown
Training: Fine-tuned

Created by

Alibaba

AI research institute of Alibaba Group.

Hangzhou, Zhejiang, China

Founded 2017

Website

Pricing

Output / 1M

$0.280

Input / 1M

$0.280

Cheapest of 7 routes · SiliconFlow

Providers(7)

DeepInfra OpenRouter Fireworks AI Novita AI Chutes AI SiliconFlow Replicate API

View 7 provider routes

About

Instruction-optimized flagship variant for demanding production applications requiring high-accuracy complex problem-solving across industries.

Qwen2.5-72B-Instruct is an open-source model in the Qwen2.5 family. The structured metadata tracks a 128k-token context window and structured outputs. This page tracks provider routes through DeepInfra, OpenRouter, Fireworks AI, and 4 more, with the cheapest tracked route listed at $0.18 input and $0.54 output per 1M tokens. Headline tracked benchmarks include Google-Proof Q&A 38.4, HellaSwag 95.6, and HumanEval 86.6.

Top use-case fit: coding, agents, and build tasks

Coding

Q/$ B

1 relevant benchmark in the decision map.

RAG

Included by capability and metadata signals in the decision map.

Long context

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare all 7

Compare API pricing across 4 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Route
SiliconFlow	$0.280	$0.280	Serverless
DeepInfra	$0.360	$0.400	Serverless
OpenRouter	$0.360	$0.400	Serverless
Novita AI	$0.380	$0.400	Serverless

Available via routers & gateways(1)

OpenRouter

Hybrid

Unified hybrid gateway to 400+ models from 60+ providers via a single OpenAI-compatible API, with optional auto-routing that selects the best model per prompt.

PassthroughFireworks AI

Capabilities

Structured Outputs

Benchmark peer barsfor Coding

HumanEvalRank 26 of 97

Claude Sonnet 4.6

98.0

96.7

Claude Opus 4.6

95.0

Grok-3

94.5

Qwen2.5-72B-Instructcurrent

86.6

Benchmark scores(5)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.

Benchmark	Score	Version	Evaluation	Source
Google-Proof Q&A	38.4	diamondObserved 2024-09-01	—	Source
HellaSwag	95.6	standardObserved 2026-03-06	—	Source
HumanEval	86.6	pass@1Observed 2024-09-01	—	Source
Massive Multitask Language Understanding	88.2	5-shotObserved 2026-03-06	—	Source
Chatbot Arena	1270.0	—Observed 2026-04-15	—	Source

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(1)

Best LLMs for ClassificationListed

Compare Qwen2.5-72B-Instruct with other models

Comparison and alternatives

Browse all comparisons →

Show all 23 popular comparisonssorted by 7-day search impressions

Frequently asked questions

What is the context window of Qwen2.5-72B-Instruct?

Qwen2.5-72B-Instruct has a context window of 128k tokens.

How much does Qwen2.5-72B-Instruct cost?

Qwen2.5-72B-Instruct pricing ranges from $0.18/1M to $1.3/1M input tokens depending on the provider.

When was Qwen2.5-72B-Instruct released?

Qwen2.5-72B-Instruct was released on 2024-06-07.

Which providers offer Qwen2.5-72B-Instruct?

Qwen2.5-72B-Instruct is available from 7 providers: DeepInfra, OpenRouter, Fireworks AI, Novita AI, Chutes AI, SiliconFlow, Replicate API.

What benchmarks has Qwen2.5-72B-Instruct been tested on?

Qwen2.5-72B-Instruct has been evaluated on 5 benchmarks, including Google-Proof Q&A, HellaSwag, HumanEval, Massive Multitask Language Understanding, Chatbot Arena.

Created by

Alibaba

AI research institute of Alibaba Group.

Hangzhou, Zhejiang, China

Founded 2017

Website

Pricing

Output / 1M

$0.280

Input / 1M

$0.280

Cheapest of 7 routes · SiliconFlow

Providers(7)

DeepInfra OpenRouter Fireworks AI Novita AI Chutes AI SiliconFlow Replicate API

View 7 provider routes