AICredits logo
Provider directory13 published models

Z.ai Models

Browse Z.ai models available through AICredits with INR pricing, supported APIs, and endpoint-specific details.

Use provider pages to compare model IDs, supported endpoints, context windows, and token pricing before integrating.

z-ai/glm-4-32b

GLM 4 32B is a cost-effective foundation language model. It can efficiently perform complex tasks and has significantly enhanced capabilities in tool use, online search, and code-related intelligent tasks. It...

Chat API
Context
128K
Input
Input from ₹9.92/1M
Cached input
Cached ₹0.99/1M
Output
Output ₹9.92/1M
View details
z-ai/glm-4.5

GLM-4.5 is our latest flagship foundation model, purpose-built for agent-based applications. It leverages a Mixture-of-Experts (MoE) architecture and supports a context length of up to 128k tokens. GLM-4.5 delivers significantly...

Chat API
Context
131K
Input
Input from ₹59.50/1M
Cached input
Cached ₹5.95/1M
Output
Output ₹218.15/1M
View details
z-ai/glm-4.5-air

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter...

Chat API
Context
131K
Input
Input from ₹12.89/1M
Cached input
Cached ₹1.29/1M
Output
Output ₹84.29/1M
View details
z-ai/glm-4.5v

GLM-4.5V is a vision-language foundation model for multimodal agent applications. Built on a Mixture-of-Experts (MoE) architecture with 106B parameters and 12B activated parameters, it achieves state-of-the-art results in video understanding,...

Chat APIVision
Context
66K
Input
Input from ₹59.50/1M
Cached input
Cached ₹5.95/1M
Output
Output ₹178.49/1M
View details
z-ai/glm-4.6

Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex...

Chat API
Context
203K
Input
Input from ₹42.64/1M
Cached input
Cached ₹4.26/1M
Output
Output ₹172.54/1M
View details
z-ai/glm-4.6v

GLM-4.6V is a large multimodal model designed for high-fidelity visual understanding and long-context reasoning across images, documents, and mixed media. It supports up to 128K tokens, processes complex page layouts...

Chat APIVision
Context
131K
Input
Input from ₹29.75/1M
Cached input
Cached ₹2.97/1M
Output
Output ₹89.24/1M
View details
z-ai/glm-4.7

GLM-4.7 is Z.ai’s latest flagship model, featuring upgrades in two key areas: enhanced programming capabilities and more stable multi-step reasoning/execution. It demonstrates significant improvements in executing complex agent tasks while...

Chat API
Context
203K
Input
Input from ₹39.66/1M
Cached input
Cached ₹3.97/1M
Output
Output ₹173.53/1M
View details
z-ai/glm-4.7-flash

As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning,...

Chat API
Context
203K
Input
Input from ₹5.95/1M
Cached input
Cached ₹0.59/1M
Output
Output ₹39.66/1M
View details
z-ai/glm-5

GLM-5 is Z.ai’s flagship open-source foundation model engineered for complex systems design and long-horizon agent workflows. Built for expert developers, it delivers production-grade performance on large-scale programming tasks, rivaling leading...

Chat API
Context
203K
Input
Input from ₹59.50/1M
Cached input
Cached ₹5.95/1M
Output
Output ₹190.39/1M
View details
z-ai/glm-5-turbo

GLM-5 Turbo is a new model from Z.ai designed for fast inference and strong performance in agent-driven environments such as OpenClaw scenarios. It is deeply optimized for real-world agent workflows...

Chat API
Context
262K
Input
Input from ₹118.99/1M
Cached input
Cached ₹11.90/1M
Output
Output ₹396.64/1M
View details
z-ai/glm-5.1

GLM-5.1 delivers a major leap in coding capability, with particularly significant gains in handling long-horizon tasks. Unlike previous models built around minute-level interactions, GLM-5.1 can work independently and continuously on...

Chat API
Context
203K
Input
Input from ₹97.18/1M
Cached input
Cached ₹9.72/1M
Output
Output ₹305.41/1M
View details
z-ai/glm-5.2

GLM 5.2 is a large-scale reasoning model from Z.ai. It supports text input and output with a 1M-token context window, and is suited for long-horizon agent workflows, project-level software engineering,...

Chat API
Context
1.0M
Input
Input from ₹138.82/1M
Output
Output ₹436.30/1M
View details
z-ai/glm-5v-turbo

GLM-5V-Turbo is Z.ai’s first native multimodal agent foundation model, built for vision-based coding and agent-driven tasks. It natively handles image, video, and text inputs, excels at long-horizon planning, complex coding,...

Chat APIVision
Context
203K
Input
Input from ₹118.99/1M
Cached input
Cached ₹11.90/1M
Output
Output ₹396.64/1M
View details