Author's Description
GLM-4.1V-9B-Thinking is a 9B parameter vision-language model developed by THUDM, based on the GLM-4-9B foundation. It introduces a reasoning-centric "thinking paradigm" enhanced with reinforcement learning to improve multimodal reasoning, long-context understanding (up to 64K tokens), and complex problem solving. It achieves state-of-the-art performance among models in its class, outperforming even larger models like Qwen-2.5-VL-72B on a majority of benchmark tasks.
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
THUDM: GLM 4.1V 9B Thinking, a 9B parameter vision-language model, demonstrates exceptional speed and competitive pricing, consistently ranking among the fastest and most cost-effective models across various benchmarks. Its reliability is strong, with an 89th percentile ranking indicating few technical issues. While excelling in cost and speed, its accuracy performance varies significantly across categories. The model achieved strong accuracy in Ethics (94.0%) and Email Classification (94.0%), showcasing its capability in structured classification and ethical reasoning. However, it exhibited a critical weakness in Instruction Following, scoring 0.0% accuracy, suggesting a significant area for improvement in complex directive adherence. Performance in Coding (74.0%), Reasoning (66.0%), and General Knowledge (72.5%) was moderate, with particularly slow durations for Reasoning and General Knowledge tasks. Overall, GLM 4.1V 9B Thinking stands out for its efficiency and affordability, but its multimodal reasoning and complex problem-solving capabilities, particularly in instruction following, require further development to fully align with its "thinking paradigm" description.
Model Pricing
Current Pricing
Feature | Price (per 1M tokens) |
---|---|
Prompt | $0.035 |
Completion | $0.138 |
Price History
Available Endpoints
Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
---|---|---|---|---|
Novita
|
Novita | thudm/glm-4.1v-9b-thinking | 65K | $0.035 / 1M tokens | $0.138 / 1M tokens |
Benchmark Results
Benchmark | Category | Reasoning | Free | Executions | Accuracy | Cost | Duration |
---|
Other Models by thudm
|
Released | Params | Context |
|
Speed | Ability | Cost |
---|---|---|---|---|---|---|---|
THUDM: GLM Z1 Rumination 32B Unavailable | Apr 25, 2025 | 32B | 32K |
Text input
Text output
|
★ | ★★★★ | $$$$ |
THUDM: GLM Z1 32B | Apr 17, 2025 | 32B | 32K |
Text input
Text output
|
★ | ★★★★ | $$$ |
THUDM: GLM 4 32B | Apr 17, 2025 | 32B | 32K |
Text input
Text output
|
★★ | ★★ | $$$ |