Z.ai: GLM 4.5 Air

Text input Text output Free Option
Author's Description

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter...

Key Specifications
Cost
$$$$$
Context
131K
Released
Jul 25, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Tools Tool Choice Temperature Include Reasoning Reasoning Max Tokens Top P
Features

This model supports the following features:

Reasoning Tools
Performance Summary

Z.ai's GLM-4.5-Air, a lightweight, agent-centric model, demonstrates exceptional speed, consistently ranking among the fastest models across nine benchmarks. Its pricing is moderate, placing it in the 27th percentile. The model exhibits outstanding reliability with a 98% success rate, indicating minimal technical failures. In terms of performance across categories, GLM-4.5-Air shows strong capabilities in Reasoning (95.6% accuracy, 81st percentile) and General Knowledge (99.5% accuracy, 72nd percentile). It also performs well in Instruction Following (68.7% accuracy, 72nd percentile) and Mathematics (92.9% accuracy, 67th percentile). A notable strength is its Ethics performance, achieving 99.0% accuracy. However, the model shows a significant weakness in Email Classification, with only 80.0% accuracy (7th percentile), suggesting an area for improvement in nuanced categorization tasks. Its hallucination rate is moderate at 90.0% accuracy, indicating room for improvement in acknowledging uncertainty. The model's "thinking mode" for advanced reasoning and tool use, alongside its hybrid inference capabilities, positions it as a versatile option for agent-centric applications.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.2
Completion $1.1
Input Cache Read $0.03

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Z.AI
Z.AI | z-ai/glm-4.5-air 131K $0.2 / 1M tokens $1.1 / 1M tokens
DeepInfra
DeepInfra | z-ai/glm-4.5-air 131K $0.13 / 1M tokens $0.85 / 1M tokens
GMICloud
GMICloud | z-ai/glm-4.5-air 131K $0.13 / 1M tokens $0.85 / 1M tokens
SiliconFlow
SiliconFlow | z-ai/glm-4.5-air 131K $0.14 / 1M tokens $0.86 / 1M tokens
AtlasCloud
AtlasCloud | z-ai/glm-4.5-air 32K $0.13 / 1M tokens $0.85 / 1M tokens
Nebius
Nebius | z-ai/glm-4.5-air 131K $0.13 / 1M tokens $0.85 / 1M tokens
Novita
Novita | z-ai/glm-4.5-air 131K $0.13 / 1M tokens $0.85 / 1M tokens
Chutes
Chutes | z-ai/glm-4.5-air 131K $0.13 / 1M tokens $0.85 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by z-ai