GLM 4.5V
glm-4.5v
ProprietaryMultimodal
About
GLM-4.5V is a vision-language MoE model from Z.ai designed for multimodal agent applications, handling both image understanding and text generation at 64K context.
GLM 4.5V has a 64K-token context window.
Capabilities
VisionMultimodalReasoningFunction CallingTool UseStructured OutputsCode Execution