Author's Description
THUDM: GLM Z1 Rumination 32B is a 32B-parameter deep reasoning model from the GLM-4-Z1 series, optimized for complex, open-ended tasks requiring prolonged deliberation. It builds upon glm-4-32b-0414 with additional reinforcement learning phases and multi-stage alignment strategies, introducing “rumination” capabilities designed to emulate extended cognitive processing. This includes iterative reasoning, multi-hop analysis, and tool-augmented workflows such as search, retrieval, and citation-aware synthesis. The model excels in research-style writing, comparative analysis, and intricate question answering. It supports function calling for search and navigation primitives (`search`, `click`, `open`, `finish`), enabling use in agent-style pipelines. Rumination behavior is governed by multi-turn loops with rule-based reward shaping and delayed decision mechanisms, benchmarked against Deep Research frameworks such as OpenAI’s internal alignment stacks. This variant is suitable for scenarios requiring depth over speed.
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
THUDM: GLM Z1 Rumination 32B is a deep reasoning model designed for complex, open-ended tasks requiring prolonged deliberation. Its performance profile indicates a strategic trade-off between speed and depth. The model exhibits significantly longer response times, ranking in the 2nd percentile for speed across benchmarks, indicating it performs among the slowest models. Conversely, it offers competitive pricing, ranking in the 44th percentile. A notable strength is its strong reliability, achieving the 86th percentile, meaning it consistently provides usable responses with few technical issues. In terms of benchmark performance, the model demonstrates exceptional capabilities in complex Reasoning tasks, achieving 88.0% accuracy and ranking in the 87th percentile. This aligns with its design for iterative reasoning and multi-hop analysis. While its General Knowledge accuracy is high at 94.5%, its percentile rank (45th) suggests a solid but not top-tier performance in this area. Performance in Ethics (81.0% accuracy, 23rd percentile) and Email Classification (88.0% accuracy, 13th percentile) is less competitive, indicating areas for potential improvement. Coding performance is average at 81.0% accuracy (52nd percentile). Overall, its strength lies in deep, deliberative tasks, making it suitable for scenarios prioritizing depth over speed, such as research-style writing and intricate question answering.
Model Pricing
Current Pricing
Feature | Price (per 1M tokens) |
---|---|
Prompt | $0.24 |
Completion | $0.24 |
Price History
Available Endpoints
Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
---|---|---|---|---|
Novita
|
Novita | thudm/glm-z1-rumination-32b-0414 | 32K | $0.24 / 1M tokens | $0.24 / 1M tokens |
Benchmark Results
Benchmark | Category | Reasoning | Free | Executions | Accuracy | Cost | Duration |
---|
Other Models by thudm
|
Released | Params | Context |
|
Speed | Ability | Cost |
---|---|---|---|---|---|---|---|
THUDM: GLM 4.1V 9B Thinking | Jul 11, 2025 | 9B | 65K |
Text input
Image input
Text output
|
★ | ★★ | $$$ |
THUDM: GLM Z1 32B | Apr 17, 2025 | 32B | 32K |
Text input
Text output
|
★ | ★★★★ | $$$ |
THUDM: GLM 4 32B | Apr 17, 2025 | 32B | 32K |
Text input
Text output
|
★★ | ★★ | $$$ |