Z.ai: GLM 5

Text input Text output
Author's Description

GLM-5 is Z.ai’s flagship open-source foundation model engineered for complex systems design and long-horizon agent workflows. Built for expert developers, it delivers production-grade performance on large-scale programming tasks, rivaling leading closed-source models. With advanced agentic planning, deep backend reasoning, and iterative self-correction, GLM-5 moves beyond code generation to full-system construction and autonomous execution.

Key Specifications
Cost
$$$$$
Context
202K
Released
Feb 11, 2026
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Response Format Stop Frequency Penalty Include Reasoning Seed Min P Structured Outputs Top P Presence Penalty Temperature Logit Bias Tool Choice Reasoning Max Tokens Tools
Features

This model supports the following features:

Tools Response Format Reasoning Structured Outputs
Performance Summary

Z.ai's GLM-5, a flagship open-source foundation model, demonstrates exceptional reliability, achieving a 95% success rate across benchmarks, indicating minimal technical failures. However, it tends to have longer response times, ranking in the 7th percentile for speed, and is positioned at premium pricing levels (11th percentile). Performance across categories reveals a strong aptitude for complex tasks. GLM-5 excels in Instruction Following (84th percentile accuracy), Reasoning (77th percentile), and Mathematics (75th percentile), showcasing its engineering for complex systems design and agent workflows. Its General Knowledge and Ethics scores are also high at 98.9% and 99.0% respectively, though their percentile rankings are more moderate. A notable weakness is its performance on Hallucinations, with only 80.0% accuracy (24th percentile), suggesting it may not consistently acknowledge uncertainty. While its Coding accuracy is solid at 87.9%, its duration for this task is among the longest. Overall, GLM-5 is a robust model for expert developers requiring high reliability and advanced reasoning, despite its slower processing and higher cost.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.95
Completion $3.15
Input Cache Read $0.19

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
AtlasCloud
AtlasCloud | z-ai/glm-5-20260211 202K $0.95 / 1M tokens $3.15 / 1M tokens
Novita
Novita | z-ai/glm-5-20260211 202K $1 / 1M tokens $3.2 / 1M tokens
Z.AI
Z.AI | z-ai/glm-5-20260211 202K $1 / 1M tokens $3.2 / 1M tokens
Phala
Phala | z-ai/glm-5-20260211 202K $1.2 / 1M tokens $3.5 / 1M tokens
GMICloud
GMICloud | z-ai/glm-5-20260211 202K $1 / 1M tokens $3.2 / 1M tokens
Parasail
Parasail | z-ai/glm-5-20260211 202K $1 / 1M tokens $3.2 / 1M tokens
Friendli
Friendli | z-ai/glm-5-20260211 202K $1 / 1M tokens $3.2 / 1M tokens
Venice
Venice | z-ai/glm-5-20260211 198K $1.1 / 1M tokens $4.15 / 1M tokens
Fireworks
Fireworks | z-ai/glm-5-20260211 202K $1 / 1M tokens $3.2 / 1M tokens
Together
Together | z-ai/glm-5-20260211 202K $1 / 1M tokens $3.2 / 1M tokens
SiliconFlow
SiliconFlow | z-ai/glm-5-20260211 204K $0.95 / 1M tokens $2.55 / 1M tokens
DeepInfra
DeepInfra | z-ai/glm-5-20260211 202K $0.8 / 1M tokens $2.56 / 1M tokens
Ambient
Ambient | z-ai/glm-5-20260211 80K $0.72 / 1M tokens $2.3 / 1M tokens
Io Net
Io Net | z-ai/glm-5-20260211 202K $0.72 / 1M tokens $2.3 / 1M tokens
BaseTen
BaseTen | z-ai/glm-5-20260211 202K $0.95 / 1M tokens $3.15 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by z-ai