LLM Reference

GLM 4.6V on Novita AI

GLM-4 · Tsinghua Knowledge Engineering Group (THUDM)

Serverless

Last refreshed 2026-05-22. Next refresh: weekly.

Why use GLM 4.6V on Novita AI?

Novita AI offers GLM 4.6V with pay-as-you-go pricing at $0.30/1M input tokens. Novita AI offers a GPU-based inference API for image, video, and language model generation with a broad catalog of open-source models.

Compare GLM 4.6V across 2 providers to find the best fit for your use case
Input / 1M
$0.30
Output / 1M
$0.90
Cache
Not sourced
Batch
Not sourced

Setup recipe

Docs fallback
Install
Use the provider REST API or SDK
Auth
Create a provider API key
Call
model: glm-4.6v
Model ID
glm-4.6v

Request example

Curated snippets for this provider have not been sourced yet.

Gotchas

No curated gotchas have been sourced for this exact provider/model route yet.

Compare GLM 4.6V Across Providers

ProviderInput (per 1M)Output (per 1M)
Vercel AI Gateway$0.30$0.90
Novita AI$0.30$0.90

Pricing

TypePrice (per 1M)
Input tokens$0.30
Output tokens$0.90

Capabilities

VisionMultimodalFunction CallingTool Use

About GLM 4.6V

GLM-4.6V is Z.ai's large multimodal model for high-fidelity visual understanding and long-context reasoning across images, charts, and documents at 128K context.

FAQ

What does GLM 4.6V cost on Novita AI?

On Novita AI, GLM 4.6V costs $0.3 per 1M input tokens and $0.9 per 1M output tokens.

What is the context window for GLM 4.6V on Novita AI?

GLM 4.6V supports a 131,072 token context window on Novita AI.

How does Novita AI compare to other GLM 4.6V providers?

GLM 4.6V is available from 2 providers. The cheapest input pricing is $0.3/1M tokens from Vercel AI Gateway.

What API model ID do I use for GLM 4.6V on Novita AI?

Use the model ID glm-4.6v when calling Novita AI's API.

Who created GLM 4.6V?

GLM 4.6V was created by Tsinghua Knowledge Engineering Group (THUDM) as part of the GLM-4 model family.

Is GLM 4.6V open source?

GLM 4.6V is not open source; the seed data lists it as proprietary.

Get Started

Model Specs

Released2026-02-01
Parameters106B (12B active)
Context128K
ArchitectureDecoder Only