Z.ai Models with INR Pricing

z-ai/glm-4-32b

GLM 4 32B is a cost-effective foundation language model. It can efficiently perform complex tasks and has significantly enhanced capabilities in tool use, online search, and code-related intelligent tasks. It...

Chat API

Context

128K

Input

Input from ₹9.92/1M

Cached input

Cached ₹0.99/1M

Output

Output ₹9.92/1M

View details

Z.ai: GLM 4.5Z.ai

z-ai/glm-4.5

GLM-4.5 is our latest flagship foundation model, purpose-built for agent-based applications. It leverages a Mixture-of-Experts (MoE) architecture and supports a context length of up to 128k tokens. GLM-4.5 delivers significantly...

Chat API

Context

131K

Input

Input from ₹59.50/1M

Cached input

Cached ₹5.95/1M

Output

Output ₹218.15/1M

View details

Z.ai: GLM 4.5 AirZ.ai

z-ai/glm-4.5-air

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter...

Chat API

Context

131K

Input

Input from ₹12.89/1M

Cached input

Cached ₹1.29/1M

Output

Output ₹84.29/1M

View details

Z.ai: GLM 4.5VZ.ai

z-ai/glm-4.5v

GLM-4.5V is a vision-language foundation model for multimodal agent applications. Built on a Mixture-of-Experts (MoE) architecture with 106B parameters and 12B activated parameters, it achieves state-of-the-art results in video understanding,...

Chat APIVision

Context

66K

Input

Input from ₹59.50/1M

Cached input

Cached ₹5.95/1M

Output

Output ₹178.49/1M

View details

Z.ai: GLM 4.6Z.ai

z-ai/glm-4.6

Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex...

Chat API

Context

203K

Input

Input from ₹42.64/1M

Cached input

Cached ₹4.26/1M

Output

Output ₹172.54/1M

View details

Z.ai: GLM 4.6VZ.ai

z-ai/glm-4.6v

GLM-4.6V is a large multimodal model designed for high-fidelity visual understanding and long-context reasoning across images, documents, and mixed media. It supports up to 128K tokens, processes complex page layouts...

Chat APIVision

Context

131K

Input

Input from ₹29.75/1M

Cached input

Cached ₹2.97/1M

Output

Output ₹89.24/1M

View details

Z.ai: GLM 4.7Z.ai

z-ai/glm-4.7

GLM-4.7 is Z.ai’s latest flagship model, featuring upgrades in two key areas: enhanced programming capabilities and more stable multi-step reasoning/execution. It demonstrates significant improvements in executing complex agent tasks while...

Chat API

Context

203K

Input

Input from ₹39.66/1M

Cached input

Cached ₹3.97/1M

Output

Output ₹173.53/1M

View details

Z.ai: GLM 4.7 FlashZ.ai

z-ai/glm-4.7-flash

As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning,...

Chat API

Context

203K

Input

Input from ₹5.95/1M

Cached input

Cached ₹0.59/1M

Output

Output ₹39.66/1M

View details

Z.ai: GLM 5Z.ai

z-ai/glm-5

GLM-5 is Z.ai’s flagship open-source foundation model engineered for complex systems design and long-horizon agent workflows. Built for expert developers, it delivers production-grade performance on large-scale programming tasks, rivaling leading...

Chat API

Context

203K

Input

Input from ₹59.50/1M

Cached input

Cached ₹5.95/1M

Output

Output ₹190.39/1M

View details

Z.ai: GLM 5 TurboZ.ai

z-ai/glm-5-turbo

GLM-5 Turbo is a new model from Z.ai designed for fast inference and strong performance in agent-driven environments such as OpenClaw scenarios. It is deeply optimized for real-world agent workflows...

Chat API

Context

262K

Input

Input from ₹118.99/1M

Cached input

Cached ₹11.90/1M

Output

Output ₹396.64/1M

View details

Z.ai: GLM 5.1Z.ai

z-ai/glm-5.1

GLM-5.1 delivers a major leap in coding capability, with particularly significant gains in handling long-horizon tasks. Unlike previous models built around minute-level interactions, GLM-5.1 can work independently and continuously on...

Chat API

Context

203K

Input

Input from ₹97.18/1M

Cached input

Cached ₹9.72/1M

Output

Output ₹305.41/1M

View details

Z.ai: GLM 5.2Z.ai

z-ai/glm-5.2

GLM 5.2 is a large-scale reasoning model from Z.ai. It supports text input and output with a 1M-token context window, and is suited for long-horizon agent workflows, project-level software engineering,...

Chat API

Context

1.0M

Input

Input from ₹138.82/1M

Output

Output ₹436.30/1M

View details

Z.ai: GLM 5V TurboZ.ai

z-ai/glm-5v-turbo

GLM-5V-Turbo is Z.ai’s first native multimodal agent foundation model, built for vision-based coding and agent-driven tasks. It natively handles image, video, and text inputs, excels at long-horizon planning, complex coding,...

Chat APIVision

Context

203K

Input

Input from ₹118.99/1M

Cached input

Cached ₹11.90/1M

Output

Output ₹396.64/1M

View details