LLM Reference

GLM-5

Released
2026-02-11
Last refreshed
2026-05-22
Status
Researched 47d ago
Open SourceCommercial use allowedCodingRAGAgentsLong contextClassificationJSON / Tool use

GLM-5 is worth evaluating for coding, rag, and agents when its provider route and context window match the workload.

Use it for

  • Teams evaluating coding, rag, and agents
  • Workloads that can use a 200k context window
  • Buyers comparing 4 tracked provider routes

Do not use it for

  • Vision or document-understanding workloads
Specifications
Family
GLM-5
Released
2026-02-11
Context
200k
Parameters
744B total, 40B active
Architecture
Mixture of Experts
Knowledge cutoff
2025-11
Specialization
general
Openness
Open source
License
MIT(OSI)Commercial use allowed
Training
finetuned
Created by

Chinese AI research lab developing GLM language models.

Beijing, China
Founded 2019
Website
Pricing
Output / 1M
$2.08
Input / 1M
$0.600

Cheapest of 7 routes · OpenRouter

About

Flagship open-weight foundation model from Zhipu AI with 744B parameters (40B active per token) in Mixture of Experts architecture. Trained on 28.5T tokens using DeepSeek Sparse Attention on Huawei Ascend hardware. Achieves state-of-the-art performance on coding and agentic benchmarks (SWE-bench Verified: 77.8%). Supports autonomous planning, multi-step tool use, and self-correction.

GLM-5 is an open-source model. The structured metadata tracks a 200k-token context window, reasoning, function calling, tool use, and structured outputs. This page tracks provider routes through Fireworks AI, OpenRouter, Together AI, and 4 more, with the cheapest tracked route listed at $0.6 input and $2.08 output per 1M tokens. Headline tracked benchmarks include SWE-bench Pro 55.1, SWE-bench Verified 77.8, and τ-bench 82.1.

Top use-case fit: coding, agents, and build tasks

Coding

Q/$ C

2 relevant benchmarks in the decision map.

RAG

Included by capability and metadata signals in the decision map.

Agents

Q/$ B

2 relevant benchmarks in the decision map.

Provider price ladder

Compare all 7

Compare API pricing across 4 providers for input and output tokens, batch, and cached reads when available.

ProviderInput / 1MOutput / 1MRoute
OpenRouter$0.600$2.08
Serverless
Fireworks AI$1.00$3.20
Serverless
GCP Vertex AI$1.00$3.20
Serverless
Novita AI$1.00$3.20
Serverless

Capabilities

ReasoningFunction CallingTool UseStructured Outputs

Benchmark peer barsfor Coding

Benchmark scores(4)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.
BenchmarkScoreVersionSource
SWE-bench Pro55.1https://huggingface.co/zai-org/GLM-5.1
SWE-bench Verified77.8SWE-bench Verifiedhttps://www.swebench.com/verified.html
τ-bench82.1τ-benchhttps://taubench.com/
SWE-rebench62.8pass@1 (best of 5 runs)https://swe-rebench.com/leaderboard

Migration checks

No linked migration route is available for this model yet.

Show all 43 popular comparisonssorted by 7-day search impressions