Z.AI: GLM 4.6 (exacto)

Text input Text output
Author's Description

Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex agentic tasks. Superior coding performance: The model achieves higher scores on code benchmarks and demonstrates better real-world performance in applications such as Claude Code、Cline、Roo Code and Kilo Code, including improvements in generating visually polished front-end pages. Advanced reasoning: GLM-4.6 shows a clear improvement in reasoning performance and supports tool use during inference, leading to stronger overall capability. More capable agents: GLM-4.6 exhibits stronger performance in tool using and search-based agents, and integrates more effectively within agent frameworks. Refined writing: Better aligns with human preferences in style and readability, and performs more naturally in role-playing scenarios.

Key Specifications
Cost
$$$$
Context
202K
Released
Sep 30, 2025
Supported Parameters

This model supports the following parameters:

Reasoning Stop Frequency Penalty Top P Response Format Temperature Include Reasoning Min P Max Tokens Tools Presence Penalty Tool Choice Seed
Features

This model supports the following features:

Response Format Reasoning Tools
Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.45
Completion $1.9

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
DeepInfra
DeepInfra | z-ai/glm-4.6:exacto 202K $0.45 / 1M tokens $1.9 / 1M tokens
Novita
Novita | z-ai/glm-4.6:exacto 204K $0.6 / 1M tokens $2.2 / 1M tokens
Z.AI
Z.AI | z-ai/glm-4.6:exacto 200K $0.6 / 1M tokens $2.2 / 1M tokens
DeepInfra
DeepInfra | z-ai/glm-4.6:exacto 202K $0.45 / 1M tokens $1.9 / 1M tokens
Other Models by z-ai