What is the context window of Qwen-72B?

Qwen-72B has a context window of 32k tokens.

How much does Qwen-72B cost?

Qwen-72B is available at $0.90/1M input tokens through Fireworks AI.

When was Qwen-72B released?

Qwen-72B was released on 2023-11-30.

Which providers offer Qwen-72B?

Qwen-72B is available from 1 provider: Fireworks AI.

Qwen-72B

Name: Qwen-72B
Author: Alibaba

Released

2023-11-30

Last refreshed

2026-05-19

Status

Researched 16d ago

Qwen-72B is worth evaluating for general LLM work when its provider route and context window match the workload.

Use it for

Teams evaluating general LLM work
Workloads that can use a 32k context window
Buyers comparing 1 tracked provider route

Do not use it for

Vision or document-understanding workloads
Strict JSON or tool-calling flows

Specifications

Family: Qwen
Released: 2023-11-30
Context: 32k
Parameters: 72B
Architecture: Decoder Only
Specialization: general
Training: finetuned

Created by

Alibaba

AI research institute of Alibaba Group.

Hangzhou, Zhejiang, China

Founded 2017

Website

Pricing

Output / 1M

$0.900

Input / 1M

$0.900

Cheapest of 1 route · Fireworks AI

Providers(1)

Fireworks AI

View 1 provider route

About

The Qwen-72B is a potent large language model developed by Alibaba Cloud, featuring 72 billion parameters and built on the Transformer architecture. This model introduces enhancements such as SwiGLU activation, attention QKV bias, and group query attention for efficient performance on complex language tasks. Trained on a dataset of over 3 trillion tokens, it excels in multiple domains, including language understanding and generation, code generation, and translation, across multiple languages like Chinese and English. With an impressive context length of up to 32,000 tokens and a vocabulary exceeding 150,000 tokens, it manages extensive text inputs well. However, it bears limitations like inherited training data biases and lack of common sense reasoning. Despite these constraints, it remains a strong performer across various benchmarks, with a subsequent version, Qwen2-72B, offering further advancements.

Qwen-72B is a model in the Qwen family. The structured metadata tracks a 32k-token context window. This page tracks provider routes through Fireworks AI, with the cheapest tracked route listed at $0.9 input and $0.9 output per 1M tokens. No headline benchmark score is tracked for Qwen-72B yet.

Top use-case fit

No primary decision-task fit is mapped for this model yet.

Provider price ladder

Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Route
Fireworks AI	$0.900	$0.900	Provisioned

Capabilities

No model capability flags are currently sourced.

Benchmark peer barsfor Coding

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(6)

Best Open Source LLMsListed Cheapest LLM APIs You Can Call Right NowListed Best Mainstream LLM APIs, RankedListed Best Free LLMs You Can Use Right NowListed Best LLMs for WritingListed Best LLMs for MarketingListed