Author's Description
GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter...
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
Z.ai's GLM-4.5-Air, a lightweight, agent-centric model, demonstrates exceptional speed, consistently ranking among the fastest models across nine benchmarks. Its pricing is moderate, placing it in the 27th percentile. The model exhibits outstanding reliability with a 98% success rate, indicating minimal technical failures. In terms of performance across categories, GLM-4.5-Air shows strong capabilities in Reasoning (95.6% accuracy, 81st percentile) and General Knowledge (99.5% accuracy, 72nd percentile). It also performs well in Instruction Following (68.7% accuracy, 72nd percentile) and Mathematics (92.9% accuracy, 67th percentile). A notable strength is its Ethics performance, achieving 99.0% accuracy. However, the model shows a significant weakness in Email Classification, with only 80.0% accuracy (7th percentile), suggesting an area for improvement in nuanced categorization tasks. Its hallucination rate is moderate at 90.0% accuracy, indicating room for improvement in acknowledging uncertainty. The model's "thinking mode" for advanced reasoning and tool use, alongside its hybrid inference capabilities, positions it as a versatile option for agent-centric applications.
Model Pricing
Current Pricing
| Feature | Price (per 1M tokens) |
|---|---|
| Prompt | $0.2 |
| Completion | $1.1 |
| Input Cache Read | $0.03 |
Price History
Available Endpoints
| Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
|---|---|---|---|---|
|
Z.AI
|
Z.AI | z-ai/glm-4.5-air | 131K | $0.2 / 1M tokens | $1.1 / 1M tokens |
|
DeepInfra
|
DeepInfra | z-ai/glm-4.5-air | 131K | $0.13 / 1M tokens | $0.85 / 1M tokens |
|
GMICloud
|
GMICloud | z-ai/glm-4.5-air | 131K | $0.13 / 1M tokens | $0.85 / 1M tokens |
|
SiliconFlow
|
SiliconFlow | z-ai/glm-4.5-air | 131K | $0.14 / 1M tokens | $0.86 / 1M tokens |
|
AtlasCloud
|
AtlasCloud | z-ai/glm-4.5-air | 32K | $0.13 / 1M tokens | $0.85 / 1M tokens |
|
Nebius
|
Nebius | z-ai/glm-4.5-air | 131K | $0.13 / 1M tokens | $0.85 / 1M tokens |
|
Novita
|
Novita | z-ai/glm-4.5-air | 131K | $0.13 / 1M tokens | $0.85 / 1M tokens |
|
Chutes
|
Chutes | z-ai/glm-4.5-air | 131K | $0.13 / 1M tokens | $0.85 / 1M tokens |
Benchmark Results
| Benchmark | Category | Reasoning | Strategy | Free | Executions | Accuracy | Cost | Duration |
|---|
Other Models by z-ai
|
|
Released | Params | Context |
|
Speed | Ability | Cost |
|---|---|---|---|---|---|---|---|
| Z.ai: GLM 5.1 | Apr 07, 2026 | — | 202K |
Text input
Text output
|
★ | ★★★★★ | $$$$$ |
| Z.ai: GLM 5V Turbo | Apr 01, 2026 | — | 202K |
Text input
Image input
Video input
Text output
|
★★ | ★★★★★ | $$$$$ |
| Z.ai: GLM 5 Turbo Unavailable | Mar 15, 2026 | — | 202K |
Text input
Text output
|
— | — | $$$$$ |
| Z.ai: GLM 5 Turbo | Mar 15, 2026 | — | 202K |
Text input
Text output
|
★★ | ★★★★★ | $$$$$ |
| Z.ai: GLM 5 | Feb 11, 2026 | — | 202K |
Text input
Text output
|
★ | ★★★★ | $$$$$ |
| Z.ai: GLM 5 Unavailable | Feb 11, 2026 | — | 204K |
Text input
Text output
|
★ | ★★★★ | $ |
| Z.ai: GLM 4.7 Flash | Jan 19, 2026 | ~30B | 200K |
Text input
Text output
|
★ | ★★★ | $$$$ |
| Z.ai: GLM 4.7 | Dec 21, 2025 | — | 200K |
Text input
Text output
|
★ | ★★★★ | $$$$$ |
| Z.ai: GLM 4.6V | Dec 08, 2025 | — | 131K |
Text input
Image input
Video input
Text output
|
★★ | ★★★★★ | $$$$ |
| Z.ai: GLM 4.6 | Sep 30, 2025 | — | 200K |
Text input
Text output
|
★ | ★★★ | $$$$$ |
| Z.ai: GLM 4.6 (exacto) Unavailable | Sep 30, 2025 | — | 202K |
Text input
Text output
|
— | — | $$$$ |
| Z.ai: GLM 4.5V | Aug 11, 2025 | ~106B | 65K |
Text input
Image input
Text output
|
★★ | ★★★ | $$$$ |
| Z.ai: GLM 4.5 | Jul 25, 2025 | — | 131K |
Text input
Text output
|
★ | ★★★★ | $$$$$ |
| Z.ai: GLM 4 32B | Jul 24, 2025 | 32B | 128K |
Text input
Text output
|
★★★ | ★ | $$ |