What is the context window of GLM-5?

GLM-5 has a context window of 200k tokens.

How much does GLM-5 cost?

GLM-5 pricing ranges from $0.6/1M to $1/1M input tokens depending on the provider.

When was GLM-5 released?

GLM-5 was released on 2026-02-11.

What benchmarks has GLM-5 been tested on?

GLM-5 has been evaluated on 4 benchmarks, including SWE-bench Pro, SWE-bench Verified, τ-bench, SWE-rebench.

GLM-5

Name: GLM-5
Author: Zhipu AI

Released

2026-02-11

Last refreshed

2026-05-22

Status

Researched 47d ago

Open SourceCommercial use allowedCodingRAGAgentsLong contextClassificationJSON / Tool use

GLM-5 is worth evaluating for coding, rag, and agents when its provider route and context window match the workload.

Use it for

Teams evaluating coding, rag, and agents
Workloads that can use a 200k context window
Buyers comparing 4 tracked provider routes

Do not use it for

Vision or document-understanding workloads

Specifications

Family: GLM-5
Released: 2026-02-11
Context: 200k
Parameters: 744B total, 40B active
Architecture: Mixture of Experts
Knowledge cutoff: 2025-11
Specialization: general
Openness: Open source
License: MIT(OSI)Commercial use allowed
Training: finetuned

Created by

Zhipu AI

Chinese AI research lab developing GLM language models.

Beijing, China

Founded 2019

Website

Pricing

Output / 1M

$2.08

Input / 1M

$0.600

Cheapest of 7 routes · OpenRouter

Providers(7)

Fireworks AI OpenRouter Together AI GCP Vertex AI NVIDIA NIM Vercel AI Gateway Novita AI

View 7 provider routes

Links

Website HuggingFace

About

Flagship open-weight foundation model from Zhipu AI with 744B parameters (40B active per token) in Mixture of Experts architecture. Trained on 28.5T tokens using DeepSeek Sparse Attention on Huawei Ascend hardware. Achieves state-of-the-art performance on coding and agentic benchmarks (SWE-bench Verified: 77.8%). Supports autonomous planning, multi-step tool use, and self-correction.

GLM-5 is an open-source model. The structured metadata tracks a 200k-token context window, reasoning, function calling, tool use, and structured outputs. This page tracks provider routes through Fireworks AI, OpenRouter, Together AI, and 4 more, with the cheapest tracked route listed at $0.6 input and $2.08 output per 1M tokens. Headline tracked benchmarks include SWE-bench Pro 55.1, SWE-bench Verified 77.8, and τ-bench 82.1.

Top use-case fit: coding, agents, and build tasks

Coding

Q/$ C

2 relevant benchmarks in the decision map.

RAG

Included by capability and metadata signals in the decision map.

Agents

Q/$ B

2 relevant benchmarks in the decision map.

Provider price ladder

Compare all 7

Compare API pricing across 4 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Route
OpenRouter	$0.600	$2.08	Serverless
Fireworks AI	$1.00	$3.20	Serverless
GCP Vertex AI	$1.00	$3.20	Serverless
Novita AI	$1.00	$3.20	Serverless

Capabilities

ReasoningFunction CallingTool UseStructured Outputs

Benchmark peer barsfor Coding

SWE-bench ProRank 15 of 27

69.2

64.3

60.6

59.0

55.1

SWE-bench VerifiedRank 18 of 53

Claude Mythos Preview

93.9

88.6

87.6

85.0

77.8

Benchmark scores(4)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.

Benchmark	Score	Version	Source
SWE-bench Pro	55.1	—	https://huggingface.co/zai-org/GLM-5.1
SWE-bench Verified	77.8	SWE-bench Verified	https://www.swebench.com/verified.html
τ-bench	82.1	τ-bench	https://taubench.com/
SWE-rebench	62.8	pass@1 (best of 5 runs)	https://swe-rebench.com/leaderboard