Z.ai: GLM 4.7 Flash

Name: Z.ai: GLM 4.7 Flash
Brand: z-ai
Price: 6.05e-8 USD
Availability: InStock
Rating: 2.9 (8 reviews)

Back

Text input Text output

Author's Description

As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning,...

Key Specifications

Cost

$$$$

Context

202K

Parameters

30B (Rumoured)

Released

Jan 19, 2026

Speed

★

Ability

★★★

Reliability

★★

Hugging Face

Supported Parameters

This model supports the following parameters:

Include Reasoning Max Tokens Tools Tool Choice Response Format Reasoning Top P Temperature

Features

This model supports the following features:

Response Format Tools Reasoning

Performance Summary

Z.ai's GLM-4.7-Flash, a 30B-class model created on January 19, 2026, is designed to balance performance and efficiency, particularly for agentic coding use cases. With a substantial context length of 200,000, it demonstrates strong reliability, achieving a 93% success rate across benchmarks. However, the model exhibits a significant weakness in speed, consistently showing longer response times, ranking in the 1st percentile. Its pricing is moderate, falling within the 38th percentile. In terms of performance across categories, GLM-4.7-Flash excels in Email Classification, achieving perfect 100% accuracy, making it the most accurate model at its price point and speed. It also shows strong capabilities in Coding (90.7% accuracy) and Ethics (98.0% accuracy). While its General Knowledge (96.5%) and Reasoning (78.0%) are respectable, its performance in Hallucinations (67.3%) and Mathematics (74.7%) is less competitive, ranking in the lower percentiles. The model's instruction following is average at 61.5%. Overall, GLM-4.7-Flash is a reliable model with notable strengths in classification and coding, but its slow processing speed and moderate hallucination and math scores are areas for potential improvement.

Model Pricing

Current Pricing

Feature	Price (per 1M tokens)
Prompt	$0.0605
Completion	$0.4

Price History

Available Endpoints

Provider	Endpoint Name	Context Length	Pricing (Input)	Pricing (Output)
Z.AI	Z.AI \| z-ai/glm-4.7-flash-20260119	200K	$0.0605 / 1M tokens	$0.4 / 1M tokens
Novita	Novita \| z-ai/glm-4.7-flash-20260119	200K	$0.07 / 1M tokens	$0.4 / 1M tokens
Phala	Phala \| z-ai/glm-4.7-flash-20260119	202K	$0.0605 / 1M tokens	$0.4 / 1M tokens
DeepInfra	DeepInfra \| z-ai/glm-4.7-flash-20260119	202K	$0.06 / 1M tokens	$0.4 / 1M tokens
Venice	Venice \| z-ai/glm-4.7-flash-20260119	128K	$0.125 / 1M tokens	$0.5 / 1M tokens
Cloudflare	Cloudflare \| z-ai/glm-4.7-flash-20260119	131K	$0.0605 / 1M tokens	$0.4 / 1M tokens

Benchmark Results

Benchmark	Category	Reasoning	Strategy	Free	Executions	Accuracy	Cost	Duration

Other Models by z-ai

	Released	Params	Context	Filter by Modalities All Modalities	Speed	Ability	Cost
Z.ai: GLM 5.2	Jun 16, 2026	~5.2B	1M	Text input Text output	★	★★★★	$$$$$
Z.ai: GLM 5.1	Apr 07, 2026	—	202K	Text input Text output	★	★★★★★	$$$$$
Z.ai: GLM 5V Turbo	Apr 01, 2026	—	202K	Image input Text input Video input Text output	★★	★★★★	$$$$$
Z.ai: GLM 5 Turbo Unavailable	Mar 15, 2026	—	202K	Text input Text output	—	—	$$$$$
Z.ai: GLM 5 Turbo	Mar 15, 2026	—	202K	Text input Text output	★★	★★★★★	$$$$$
Z.ai: GLM 5	Feb 11, 2026	—	202K	Text input Text output	★	★★★★	$$$$$
Z.ai: GLM 5 Unavailable	Feb 11, 2026	—	204K	Text input Text output	★	★★★★	$
Z.ai: GLM 4.7	Dec 21, 2025	—	202K	Text input Text output	★	★★★★	$$$$$
Z.ai: GLM 4.6V	Dec 08, 2025	—	131K	Image input Text input Video input Text output	★★	★★★★★	$$$$
Z.ai: GLM 4.6	Sep 30, 2025	—	202K	Text input Text output	★	★★★	$$$$$
Z.ai: GLM 4.6 (exacto) Unavailable	Sep 30, 2025	—	202K	Text input Text output	—	—	$$$$
Z.ai: GLM 4.5V	Aug 11, 2025	~106B	65K	Image input Text input Text output	★★	★★★	$$$$
Z.ai: GLM 4.5	Jul 25, 2025	—	131K	Text input Text output	★	★★★★	$$$$$
Z.ai: GLM 4.5 Air	Jul 25, 2025	—	131K	Text input Text output	★	★★	$$$$
Z.ai: GLM 4 32B Unavailable	Jul 24, 2025	32B	128K	Text input Text output	★★★	★	$$