Z.ai: GLM 4.6 (exacto)

Name: Z.ai: GLM 4.6 (exacto)
Brand: z-ai
Availability: OutOfStock

Back

Text input Text output Unavailable

Author's Description

Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex agentic tasks. Superior coding performance: The model achieves higher scores on code benchmarks and demonstrates better real-world performance in applications such as Claude Code、Cline、Roo Code and Kilo Code, including improvements in generating visually polished front-end pages. Advanced reasoning: GLM-4.6 shows a clear improvement in reasoning performance and supports tool use during inference, leading to stronger overall capability. More capable agents: GLM-4.6 exhibits stronger performance in tool using and search-based agents, and integrates more effectively within agent frameworks. Refined writing: Better aligns with human preferences in style and readability, and performs more naturally in role-playing scenarios.

Key Specifications

Cost

$$$$

Context

202K

Released

Sep 30, 2025

Supported Parameters

This model supports the following parameters:

Frequency Penalty Min P Reasoning Presence Penalty Stop Top P Response Format Include Reasoning Temperature Seed Max Tokens Tools Tool Choice

Features

This model supports the following features:

Tools Reasoning Response Format

Model Pricing

Current Pricing

Feature	Price (per 1M tokens)
Prompt	$0.44
Completion	$1.76
Input Cache Read	$0.11

Price History

Available Endpoints

Provider	Endpoint Name	Context Length	Pricing (Input)	Pricing (Output)
DeepInfra	DeepInfra \| z-ai/glm-4.6:exacto	202K	$0.44 / 1M tokens	$1.76 / 1M tokens
Novita	Novita \| z-ai/glm-4.6:exacto	204K	$0.44 / 1M tokens	$1.76 / 1M tokens
Z.AI	Z.AI \| z-ai/glm-4.6:exacto	200K	$0.6 / 1M tokens	$2.2 / 1M tokens
DeepInfra	DeepInfra \| z-ai/glm-4.6:exacto	202K	$0.44 / 1M tokens	$1.76 / 1M tokens

No benchmark execution results are available for this model yet.

Other Models by z-ai

	Released	Params	Context	Filter by Modalities All Modalities	Speed	Ability	Cost
Z.ai: GLM 5.2	Jun 16, 2026	~5.2B	1M	Text input Text output	★	★★★★	$$$$$
Z.ai: GLM 5.1	Apr 07, 2026	—	202K	Text input Text output	★	★★★★★	$$$$$
Z.ai: GLM 5V Turbo	Apr 01, 2026	—	202K	Image input Video input Text input Text output	★★	★★★★	$$$$$
Z.ai: GLM 5 Turbo Unavailable	Mar 15, 2026	—	202K	Text input Text output	—	—	$$$$$
Z.ai: GLM 5 Turbo	Mar 15, 2026	—	262K	Text input Text output	★★	★★★★★	$$$$$
Z.ai: GLM 5	Feb 11, 2026	—	202K	Text input Text output	★	★★★★	$$$$$
Z.ai: GLM 5 Unavailable	Feb 11, 2026	—	204K	Text input Text output	★	★★★★	$
Z.ai: GLM 4.7 Flash	Jan 19, 2026	~30B	202K	Text input Text output	★	★★★	$$$$
Z.ai: GLM 4.7	Dec 21, 2025	—	202K	Text input Text output	★	★★★★	$$$$$
Z.ai: GLM 4.6V	Dec 08, 2025	—	131K	Image input Video input Text input Text output	★★	★★★★★	$$$$
Z.ai: GLM 4.6	Sep 30, 2025	—	202K	Text input Text output	★	★★★	$$$$$
Z.ai: GLM 4.5V	Aug 11, 2025	~106B	65K	Image input Text input Text output	★★	★★★	$$$$
Z.ai: GLM 4.5	Jul 25, 2025	—	131K	Text input Text output	★	★★★★	$$$$$
Z.ai: GLM 4.5 Air	Jul 25, 2025	—	131K	Text input Text output	★	★★	$$$$
Z.ai: GLM 4 32B Unavailable	Jul 24, 2025	32B	128K	Text input Text output	★★★	★	$$