GLM-Z1 Models by Tsinghua Knowledge Engineering Group (THUDM)
3 models2025Up to 128K ctxFrom $0.2/1M input
About
GLM-Z1 is a family of 3 AI models by Tsinghua Knowledge Engineering Group (THUDM), released in 2025.
Current Variants
Use-when guidance is derived from seed capabilities, context, release, and replacement fields.
3 in view
GLM Z1 Rumination 32BCurrent
Use when the workload needs reasoning, 128K context, and 32B parameters.
2025-01reasoning128K context32B parameters
GLM Z1 32BCurrent
Use when the workload needs 128K context and 32B parameters.
2025-01128K context32B parameters
| Model | Use when | Released | Signals | Status |
|---|---|---|---|---|
| GLM-Z1 9B | Use when the workload needs 9B parameters. | 2025-01 | 9B parameters | Current |
| GLM Z1 Rumination 32B | Use when the workload needs reasoning, 128K context, and 32B parameters. | 2025-01 | reasoning128K context32B parameters | Current |
| GLM Z1 32B | Use when the workload needs 128K context and 32B parameters. | 2025-01 | 128K context32B parameters | Current |
Release Timeline
1 release group2025-01
3 current
GLM Z1 32B
Current128K context32B parameters
GLM Z1 Rumination 32B
Currentreasoning128K context32B parameters
GLM-Z1 9B
Current9B parameters
Specifications(3 models)
| Model | Released | Context | Parameters | Reasoning |
|---|---|---|---|---|
| GLM-Z1 9B | 2025-01 | — | 9B | No |
| GLM Z1 Rumination 32B | 2025-01 | 128K | 32B | Yes |
| GLM Z1 32B | 2025-01 | 128K | 32B | No |
Available From(1 provider)
Pricing
| Model | Provider | Input / 1M | Output / 1M | Type |
|---|---|---|---|---|
| GLM-Z1 9B | Fireworks AI | $0.2 | $0.2 | Serverless |
| GLM Z1 Rumination 32B | Fireworks AI | $0.9 | $0.9 | Serverless |
| GLM Z1 32B | Fireworks AI | $0.9 | $0.9 | Serverless |
Frequently Asked Questions
- What is GLM-Z1 used for?
- GLM-Z1 is used for reasoning. The family description and listed model capabilities point to those workloads as the best fit.
- How does GLM-Z1 compare to ChatGLM-4?
- GLM-Z1 by Tsinghua Knowledge Engineering Group (THUDM) is strongest where you need reasoning, while ChatGLM-4 by Tsinghua Knowledge Engineering Group (THUDM) is the closest related family to check for adjacent model selection. GLM-Z1 has 3 listed variants and reaches up to 128K context, so compare the specs and pricing tables before choosing a production model.
- Which GLM-Z1 model should I use?
- For the lowest listed input price, start with GLM-Z1 9B through Fireworks AI at $0.2/1M input tokens. For the most capable/latest local choice, evaluate GLM Z1 Rumination 32B with 128K context and reasoning.





