LLM Reference

GLM-4.6 on Novita AI

GLM-4 · Tsinghua Knowledge Engineering Group (THUDM)

Serverless

Last refreshed 2026-05-22. Next refresh: weekly.

Why use GLM-4.6 on Novita AI?

Novita AI offers GLM-4.6 with pay-as-you-go pricing at $0.55/1M input tokens. Novita AI offers a GPU-based inference API for image, video, and language model generation with a broad catalog of open-source models.

Compare GLM-4.6 across 4 providers to find the best fit for your use case
Input / 1M
$0.55
Output / 1M
$2.20
Cache
Not sourced
Batch
Not sourced

Setup recipe

Docs fallback
Install
Use the provider REST API or SDK
Auth
Create a provider API key
Call
model: glm-4.6
Model ID
glm-4.6

Request example

Curated snippets for this provider have not been sourced yet.

Gotchas

No curated gotchas have been sourced for this exact provider/model route yet.

Compare GLM-4.6 Across Providers

ProviderInput (per 1M)Output (per 1M)
Fireworks AI$0.90$0.90
OpenRouter$0.39$1.90
Vercel AI Gateway$0.60$2.20
Novita AI$0.55$2.20

Pricing

TypePrice (per 1M)
Input tokens$0.55
Output tokens$2.20

Capabilities

Structured Outputs

About GLM-4.6

GLM-4.6 is Tsinghua Knowledge Engineering Group (THUDM)'s GLM-4 model. It offers a 198K-token context window.

FAQ

What does GLM-4.6 cost on Novita AI?

On Novita AI, GLM-4.6 costs $0.55 per 1M input tokens and $2.2 per 1M output tokens.

What is the context window for GLM-4.6 on Novita AI?

GLM-4.6 supports a 204,800 token context window on Novita AI.

How does Novita AI compare to other GLM-4.6 providers?

GLM-4.6 is available from 4 providers. The cheapest input pricing is $0.39/1M tokens from OpenRouter.

What API model ID do I use for GLM-4.6 on Novita AI?

Use the model ID glm-4.6 when calling Novita AI's API.

Who created GLM-4.6?

GLM-4.6 was created by Tsinghua Knowledge Engineering Group (THUDM) as part of the GLM-4 model family.

Is GLM-4.6 open source?

GLM-4.6 is not open source; the seed data lists it as proprietary.

Get Started

Model Specs

Released2025-01-01
Parameters355B (32B active)
Context198K
ArchitectureDecoder Only