LLM Reference

Qwen-72B

Released
2023-11-30
Last refreshed
2026-05-19
Status
Researched 16d ago

Qwen-72B is worth evaluating for general LLM work when its provider route and context window match the workload.

Use it for

  • Teams evaluating general LLM work
  • Workloads that can use a 32k context window
  • Buyers comparing 1 tracked provider route

Do not use it for

  • Vision or document-understanding workloads
  • Strict JSON or tool-calling flows
Specifications
Family
Qwen
Released
2023-11-30
Context
32k
Parameters
72B
Architecture
Decoder Only
Specialization
general
Training
finetuned
Created by

AI research institute of Alibaba Group.

Hangzhou, Zhejiang, China
Founded 2017
Website
Pricing
Output / 1M
$0.900
Input / 1M
$0.900

Cheapest of 1 route · Fireworks AI

About

The Qwen-72B is a potent large language model developed by Alibaba Cloud, featuring 72 billion parameters and built on the Transformer architecture. This model introduces enhancements such as SwiGLU activation, attention QKV bias, and group query attention for efficient performance on complex language tasks. Trained on a dataset of over 3 trillion tokens, it excels in multiple domains, including language understanding and generation, code generation, and translation, across multiple languages like Chinese and English. With an impressive context length of up to 32,000 tokens and a vocabulary exceeding 150,000 tokens, it manages extensive text inputs well. However, it bears limitations like inherited training data biases and lack of common sense reasoning. Despite these constraints, it remains a strong performer across various benchmarks, with a subsequent version, Qwen2-72B, offering further advancements.

Qwen-72B is a model in the Qwen family. The structured metadata tracks a 32k-token context window. This page tracks provider routes through Fireworks AI, with the cheapest tracked route listed at $0.9 input and $0.9 output per 1M tokens. No headline benchmark score is tracked for Qwen-72B yet.

Top use-case fit

No primary decision-task fit is mapped for this model yet.

Provider price ladder

Compare API pricing across 1 providers for input and output tokens, batch, and cached reads when available.

ProviderInput / 1MOutput / 1MRoute
Fireworks AI$0.900$0.900
Provisioned

Capabilities

No model capability flags are currently sourced.

Benchmark peer barsfor Coding

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(6)