What is the context window of GLM-5V-Turbo?

GLM-5V-Turbo has a context window of 200k tokens.

How much does GLM-5V-Turbo cost?

GLM-5V-Turbo is available at $1.2/1M input tokens through OpenRouter.

When was GLM-5V-Turbo released?

GLM-5V-Turbo was released on 2026-04-01.

Which providers offer GLM-5V-Turbo?

GLM-5V-Turbo is available from 2 providers: OpenRouter, Vercel AI Gateway.

GLM-5V-Turbo

Name: GLM-5V-Turbo
Author: Zhipu AI

glm-5v-turbo

Researched 33d ago

Last refreshed 2026-05-22. Next refresh: weekly.

ProprietaryMultimodalRAGAgentsLong contextVisionJSON / Tool use

GLM-5V-Turbo is worth evaluating for rag, agents, and long context when its provider route and context window match the workload.
Decision context: RAG task fit, 2 tracked provider routes, and research from 2026-04-19.

Use it for

Teams evaluating rag, agents, and long context
Workloads that can use a 200k context window
Buyers comparing 2 tracked provider routes

Do not use it for

Workloads where another current model has stronger sourced task evidence

Cheapest output

$4.00

OpenRouter per 1M tokens

Provider routes

Tracked API hosts

Quality / dollar

Unknown

No task benchmark coverage yet

Freshness

2026-04-19

Researched 33d ago

aging

Top use-case fit

RAG

Included by capability and metadata signals in the decision map.

Agents

Included by capability and metadata signals in the decision map.

Long context

Included by capability and metadata signals in the decision map.

Provider price ladder

Compare all 2

Provider	Input / 1M	Output / 1M	Cache	Route
OpenRouter	$1.20	$4.00	-	Serverless
Vercel AI Gateway	$1.20	$4.00	read $0.240	Serverless

Benchmark peer barsfor RAG

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

About

First native multimodal variant of GLM-5 with CogViT visual encoder. Specialized for design-to-code tasks—converts mockups, screenshots, Figma exports, and hand-drawn sketches into HTML, CSS, and JavaScript. Trained with reinforcement learning across 30+ task types with INT8 quantization. Achieved 94.8 on Design2Code benchmark (vs Claude Opus 4.6: 77.3). Supports image, video, and text inputs natively.

GLM-5V-Turbo has a 200K-token context window.

GLM-5V-Turbo input tokens at $1.2/1M, output at $4/1M.