LLM Reference

GLM-Z1 Models by Tsinghua Knowledge Engineering Group (THUDM)

3 models2025Up to 128k ctxFrom $0.2/1M input

Details

LicenseMITOSI-approved
Commercial useCommercial use: permitted
Models3
Released2025
Max context128k

Capabilities

Reasoning1 of 3 models

About

GLM-Z1 is a family of 3 AI models by Tsinghua Knowledge Engineering Group (THUDM), released in 2025.

Current Variants

Use-when guidance is based on each model's tracked capabilities, context window, release date, and replacement status.

3 in view
GLM-Z1 9BCurrent

Use when the workload needs 9B parameters.

2025-019B parameters

Use when the workload needs reasoning, 128k context, and 32B parameters.

2025-01reasoning128k context32B parameters
GLM Z1 32BCurrent

Use when the workload needs 128k context and 32B parameters.

2025-01128k context32B parameters

Release Timeline

1 release group
2025-01
3 current
GLM Z1 32B
128k context32B parameters
Current
GLM Z1 Rumination 32B
reasoning128k context32B parameters
Current
GLM-Z1 9B
9B parameters
Current

Specifications(3 models)

GLM-Z1 model specifications comparison
ModelReleasedContextParametersReasoning
GLM-Z1 9B2025-019BNo
GLM Z1 Rumination 32B2025-01128k32BYes
GLM Z1 32B2025-01128k32BNo

Available From(1 provider)

Pricing

GLM-Z1 model pricing by provider
ModelProviderInput / 1MOutput / 1MType
GLM-Z1 9BFireworks AI$0.2$0.2Serverless
GLM Z1 Rumination 32BFireworks AI$0.9$0.9Serverless
GLM Z1 32BFireworks AI$0.9$0.9Serverless

Frequently Asked Questions

What is GLM-Z1 used for?
GLM-Z1 is used for reasoning. The family description and listed model capabilities point to those workloads as the best fit.
How does GLM-Z1 compare to ChatGLM-4?
GLM-Z1 by Tsinghua Knowledge Engineering Group (THUDM) is strongest where you need reasoning, while ChatGLM-4 by Tsinghua Knowledge Engineering Group (THUDM) is the closest related family to check for adjacent model selection. GLM-Z1 has 3 listed variants and reaches up to 128k context, while ChatGLM-4 reaches up to 128k context, so compare the specs and pricing tables before choosing a production model.
Which GLM-Z1 model should I use?
For the lowest listed input price, start with GLM-Z1 9B through Fireworks AI at $0.2/1M input tokens. For the most capable/latest local choice, evaluate GLM Z1 Rumination 32B with 128k context and reasoning.