THUDM: GLM Z1 Rumination 32B

Text input Text output Unavailable
Author's Description

THUDM: GLM Z1 Rumination 32B is a 32B-parameter deep reasoning model from the GLM-4-Z1 series, optimized for complex, open-ended tasks requiring prolonged deliberation. It builds upon glm-4-32b-0414 with additional reinforcement learning phases and multi-stage alignment strategies, introducing “rumination” capabilities designed to emulate extended cognitive processing. This includes iterative reasoning, multi-hop analysis, and tool-augmented workflows such as search, retrieval, and citation-aware synthesis. The model excels in research-style writing, comparative analysis, and intricate question answering. It supports function calling for search and navigation primitives (`search`, `click`, `open`, `finish`), enabling use in agent-style pipelines. Rumination behavior is governed by multi-turn loops with rule-based reward shaping and delayed decision mechanisms, benchmarked against Deep Research frameworks such as OpenAI’s internal alignment stacks. This variant is suitable for scenarios requiring depth over speed.

Key Specifications
Cost
$$$$
Context
32K
Parameters
32B
Released
Apr 25, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Include Reasoning Stop Presence Penalty Logit Bias Top P Temperature Seed Min P Reasoning Frequency Penalty Max Tokens
Features

This model supports the following features:

Reasoning
Performance Summary

THUDM: GLM Z1 Rumination 32B is a deep reasoning model designed for complex, open-ended tasks requiring prolonged deliberation. Its performance profile indicates a strategic trade-off between speed and depth. The model exhibits significantly longer response times, ranking in the 2nd percentile for speed across benchmarks, indicating it performs among the slowest models. Conversely, it offers competitive pricing, ranking in the 44th percentile. A notable strength is its strong reliability, achieving the 86th percentile, meaning it consistently provides usable responses with few technical issues. In terms of benchmark performance, the model demonstrates exceptional capabilities in complex Reasoning tasks, achieving 88.0% accuracy and ranking in the 87th percentile. This aligns with its design for iterative reasoning and multi-hop analysis. While its General Knowledge accuracy is high at 94.5%, its percentile rank (45th) suggests a solid but not top-tier performance in this area. Performance in Ethics (81.0% accuracy, 23rd percentile) and Email Classification (88.0% accuracy, 13th percentile) is less competitive, indicating areas for potential improvement. Coding performance is average at 81.0% accuracy (52nd percentile). Overall, its strength lies in deep, deliberative tasks, making it suitable for scenarios prioritizing depth over speed, such as research-style writing and intricate question answering.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.24
Completion $0.24

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Novita
Novita | thudm/glm-z1-rumination-32b-0414 32K $0.24 / 1M tokens $0.24 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Free Executions Accuracy Cost Duration
Other Models by thudm