GLM-4 9B
GLM-4 9B is worth evaluating for long context when its provider route and context window match the workload.
Use it for
- Teams evaluating long context
- Workloads that can use a 131k context window
- Buyers comparing 4 tracked provider routes
Do not use it for
- Vision or document-understanding workloads
- Strict JSON or tool-calling flows
- Family
- GLM-4
- Released
- 2024-06-05
- Context
- 131k
- Parameters
- 9B
- Architecture
- Decoder Only
- Specialization
- general
- Training
- finetuned
Cheapest of 4 routes · AWS Bedrock
About
GLM-4 9B is Tsinghua Knowledge Engineering Group (THUDM)'s GLM-4 model. It offers a 128K-token context window.
GLM-4 9B is a model in the GLM-4 family. The structured metadata tracks a 131k-token context window. This page tracks provider routes through Fireworks AI, Bitdeer AI, AWS Bedrock, and 1 more, with the cheapest tracked route listed at $0.1 input and $0.1 output per 1M tokens. No headline benchmark score is tracked for GLM-4 9B yet.
Top use-case fit
Long context
Included by capability and metadata signals in the decision map.
Provider price ladder
Compare all 4Compare API pricing across 4 providers for input and output tokens, batch, and cached reads when available.
| Provider | Input / 1M | Output / 1M | Route |
|---|---|---|---|
| AWS Bedrock | $0.100 | $0.100 | Serverless |
| GCP Vertex AI | $0.100 | $0.100 | Serverless |
| Fireworks AI | $0.200 | $0.200 | Serverless |
| Bitdeer AI | $0.140 | $0.420 | Serverless |
Capabilities
No model capability flags are currently sourced.
Benchmark peer barsfor Long context
No task-mapped benchmark peers are available for this model yet.
Migration checks
No linked migration route is available for this model yet.