Using GLM-4.7 Flash on Novita AI

Implementation guide · GLM-4 · Tsinghua Knowledge Engineering Group (THUDM)

ServerlessOpen Source

Quick Start

Code examples for this provider have not been sourced yet.

Novita AI offers a GPU-based inference API for image, video, and language model generation with a broad catalog of open-source models.

Type	Price (per 1M)
Input tokens	$0.07
Output tokens	$0.40

Structured Outputs

GLM-4.7 Flash is Tsinghua Knowledge Engineering Group (THUDM)'s GLM-4 model. It offers a 198K-token context window.

Released2025-01-01

Parameters30B (3B active)

Context198k

ArchitectureDecoder Only