What is the context window of GLM-4 9B?

GLM-4 9B has a context window of 131k tokens.

How much does GLM-4 9B cost?

GLM-4 9B pricing ranges from $0.10/1M to $0.2/1M input tokens depending on the provider.

When was GLM-4 9B released?

GLM-4 9B was released on 2024-06-05.

Which providers offer GLM-4 9B?

GLM-4 9B is available from 4 providers: Fireworks AI, Bitdeer AI, AWS Bedrock, GCP Vertex AI.

GLM-4 9B

Name: GLM-4 9B
Author: Tsinghua Knowledge Engineering Group (THUDM)

Released

2024-06-05

Last refreshed

2026-05-19

Status

Researched 16d ago

Long context

GLM-4 9B is worth evaluating for long context when its provider route and context window match the workload.

Use it for

Teams evaluating long context
Workloads that can use a 131k context window
Buyers comparing 4 tracked provider routes

Do not use it for

Vision or document-understanding workloads
Strict JSON or tool-calling flows

Specifications

Family: GLM-4
Released: 2024-06-05
Context: 131k
Parameters: 9B
Architecture: Decoder Only
Specialization: general
Training: finetuned

Created by

Tsinghua Knowledge Engineering Group (THUDM)

Leading China's LLM innovation surge

Beijing, China

Founded 2018

Website

Pricing

Output / 1M

$0.100

Input / 1M

$0.100

Cheapest of 4 routes · AWS Bedrock

Providers(4)

Fireworks AI Bitdeer AI AWS Bedrock GCP Vertex AI

View 4 provider routes

About

GLM-4 9B is Tsinghua Knowledge Engineering Group (THUDM)'s GLM-4 model. It offers a 128K-token context window.

GLM-4 9B is a model in the GLM-4 family. The structured metadata tracks a 131k-token context window. This page tracks provider routes through Fireworks AI, Bitdeer AI, AWS Bedrock, and 1 more, with the cheapest tracked route listed at $0.1 input and $0.1 output per 1M tokens. No headline benchmark score is tracked for GLM-4 9B yet.

Top use-case fit

Long context

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare all 4

Compare API pricing across 4 providers for input and output tokens, batch, and cached reads when available.

Provider	Input / 1M	Output / 1M	Route
AWS Bedrock	$0.100	$0.100	Serverless
GCP Vertex AI	$0.100	$0.100	Serverless
Fireworks AI	$0.200	$0.200	Serverless
Bitdeer AI	$0.140	$0.420	Serverless

Capabilities

No model capability flags are currently sourced.

Benchmark peer barsfor Long context

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(6)

Best Small Language Models (SLMs)Listed Cheapest LLM APIs You Can Call Right NowListed Best Long Context LLMsListed Best Mainstream LLM APIs, RankedListed Best LLMs for WritingListed Best LLMs for MarketingListed