Z.ai: GLM 5.1

Text input Text output
Author's Description

GLM-5.1 delivers a major leap in coding capability, with particularly significant gains in handling long-horizon tasks. Unlike previous models built around minute-level interactions, GLM-5.1 can work independently and continuously on...

Key Specifications
Cost
$$$$$
Context
202K
Released
Apr 07, 2026
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Temperature Include Reasoning Reasoning Presence Penalty Max Tokens Seed Response Format Frequency Penalty Top P Stop
Features

This model supports the following features:

Reasoning Response Format
Performance Summary

Z.ai's GLM-5.1, released on April 7, 2026, demonstrates moderate speed performance, ranking in the 20th percentile across benchmarks. It is positioned at a premium price point, falling into the 10th percentile for cost. A standout feature is its exceptional reliability, achieving a 100% success rate across all 8 benchmarks, indicating consistent and dependable operation. The model excels in several key areas. It achieves perfect accuracy in Hallucinations, General Knowledge, and Reasoning, often being the most accurate model at its price point and speed. Its coding capabilities are particularly strong, scoring 96.0% accuracy and ranking in the 96th percentile, aligning with its description of a major leap in coding. Instruction Following and Mathematics also show strong performance, with 83.0% and 95.0% accuracy respectively, placing them in high percentiles. While its Ethics and Email Classification scores are respectable at 98.0% and 97.0%, they rank lower in comparison to other models in those specific categories. The model's primary strength lies in its high accuracy across complex cognitive tasks and its robust reliability, making it a powerful tool despite its moderate speed and premium pricing.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $1.06
Completion $4.4
Input Cache Read $0.26

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Io Net
Io Net | z-ai/glm-5.1-20260406 202K $1.06 / 1M tokens $4.4 / 1M tokens
DeepInfra
DeepInfra | z-ai/glm-5.1-20260406 202K $0.95 / 1M tokens $3.15 / 1M tokens
Parasail
Parasail | z-ai/glm-5.1-20260406 202K $1.4 / 1M tokens $4.4 / 1M tokens
Z.AI
Z.AI | z-ai/glm-5.1-20260406 202K $1.4 / 1M tokens $4.4 / 1M tokens
Friendli
Friendli | z-ai/glm-5.1-20260406 202K $1.4 / 1M tokens $4.4 / 1M tokens
AtlasCloud
AtlasCloud | z-ai/glm-5.1-20260406 202K $1.4 / 1M tokens $4.4 / 1M tokens
Novita
Novita | z-ai/glm-5.1-20260406 204K $1.4 / 1M tokens $4.4 / 1M tokens
Fireworks
Fireworks | z-ai/glm-5.1-20260406 202K $1.4 / 1M tokens $4.4 / 1M tokens
Venice
Venice | z-ai/glm-5.1-20260406 200K $1.75 / 1M tokens $5.5 / 1M tokens
GMICloud
GMICloud | z-ai/glm-5.1-20260406 202K $1.12 / 1M tokens $3.52 / 1M tokens
Together
Together | z-ai/glm-5.1-20260406 202K $1.4 / 1M tokens $4.4 / 1M tokens
SiliconFlow
SiliconFlow | z-ai/glm-5.1-20260406 204K $1.4 / 1M tokens $4.4 / 1M tokens
Chutes
Chutes | z-ai/glm-5.1-20260406 202K $0.95 / 1M tokens $3.15 / 1M tokens
Inceptron
Inceptron | z-ai/glm-5.1-20260406 202K $1.4 / 1M tokens $4.4 / 1M tokens
DeepInfra
DeepInfra | z-ai/glm-5.1-20260406 202K $1.4 / 1M tokens $4.4 / 1M tokens
Morph
Morph | z-ai/glm-5.1-20260406 202K $1.9 / 1M tokens $4.9 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by z-ai