GLM-Z1 Models by Tsinghua Knowledge Engineering Group (THUDM)
3 models2025Up to 128k ctxFrom $0.2/1M input
Details
LicenseMITOSI-approved
Commercial useCommercial use: permitted
Models3
Released2025
Max context128k
Capabilities
Reasoning1 of 3 models
About
GLM-Z1 is a family of 3 AI models by Tsinghua Knowledge Engineering Group (THUDM), released in 2025.
Current Variants
Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.
3 in view
GLM Z1 Rumination 32BCurrent
Use when the workload needs reasoning, 128k context, and 32B parameters.
2025-01reasoning128k context32B parameters
GLM Z1 32BCurrent
Use when the workload needs 128k context and 32B parameters.
2025-01128k context32B parameters
| Model | Use when | Released | Signals | Status |
|---|---|---|---|---|
| GLM-Z1 9B | Use when the workload needs 9B parameters. | 2025-01 | 9B parameters | Current |
| GLM Z1 Rumination 32B | Use when the workload needs reasoning, 128k context, and 32B parameters. | 2025-01 | reasoning128k context32B parameters | Current |
| GLM Z1 32B | Use when the workload needs 128k context and 32B parameters. | 2025-01 | 128k context32B parameters | Current |
Release Timeline
1 release group2025-01
3 current
GLM Z1 32B
Current128k context32B parameters
GLM Z1 Rumination 32B
Currentreasoning128k context32B parameters
GLM-Z1 9B
Current9B parameters
Specifications(3 models)
| Model | Released | Context | Parameters | Reasoning |
|---|---|---|---|---|
| GLM-Z1 9B | 2025-01 | — | 9B | No |
| GLM Z1 Rumination 32B | 2025-01 | 128k | 32B | Yes |
| GLM Z1 32B | 2025-01 | 128k | 32B | No |
Available From(1 provider)
Pricing
| Model | Provider | Input / 1M | Output / 1M | Type |
|---|---|---|---|---|
| GLM-Z1 9B | Fireworks AI | $0.2 | $0.2 | Serverless |
| GLM Z1 Rumination 32B | Fireworks AI | $0.9 | $0.9 | Serverless |
| GLM Z1 32B | Fireworks AI | $0.9 | $0.9 | Serverless |
Frequently Asked Questions
- What is GLM-Z1 used for?
- GLM-Z1 is used for reasoning. The family description and listed model capabilities point to those workloads as the best fit.
- How does GLM-Z1 compare to ChatGLM-4?
- GLM-Z1 by Tsinghua Knowledge Engineering Group (THUDM) is strongest where you need reasoning, while ChatGLM-4 by Tsinghua Knowledge Engineering Group (THUDM) is the closest related family to check for adjacent model selection. GLM-Z1 has 3 listed variants and reaches up to 128k context, while ChatGLM-4 reaches up to 128k context, so compare the specs and pricing tables before choosing a production model.
- Which GLM-Z1 model should I use?
- For the lowest listed input price, start with GLM-Z1 9B through Fireworks AI at $0.2/1M input tokens. For the most capable/latest local choice, evaluate GLM Z1 Rumination 32B with 128k context and reasoning.





