THUDM: GLM 4.1V 9B Thinking

Text input Image input Text output
Author's Description

GLM-4.1V-9B-Thinking is a 9B parameter vision-language model developed by THUDM, based on the GLM-4-9B foundation. It introduces a reasoning-centric "thinking paradigm" enhanced with reinforcement learning to improve multimodal reasoning, long-context understanding (up to 64K tokens), and complex problem solving. It achieves state-of-the-art performance among models in its class, outperforming even larger models like Qwen-2.5-VL-72B on a majority of benchmark tasks.

Key Specifications
Cost
$$$
Context
65K
Parameters
9B
Released
Jul 11, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Include Reasoning Stop Presence Penalty Logit Bias Top P Temperature Seed Min P Reasoning Frequency Penalty Max Tokens
Features

This model supports the following features:

Reasoning
Performance Summary

THUDM: GLM 4.1V 9B Thinking, a 9B parameter vision-language model, demonstrates exceptional speed and competitive pricing, consistently ranking among the fastest and most cost-effective models across various benchmarks. Its reliability is strong, with an 89th percentile ranking indicating few technical issues. While excelling in cost and speed, its accuracy performance varies significantly across categories. The model achieved strong accuracy in Ethics (94.0%) and Email Classification (94.0%), showcasing its capability in structured classification and ethical reasoning. However, it exhibited a critical weakness in Instruction Following, scoring 0.0% accuracy, suggesting a significant area for improvement in complex directive adherence. Performance in Coding (74.0%), Reasoning (66.0%), and General Knowledge (72.5%) was moderate, with particularly slow durations for Reasoning and General Knowledge tasks. Overall, GLM 4.1V 9B Thinking stands out for its efficiency and affordability, but its multimodal reasoning and complex problem-solving capabilities, particularly in instruction following, require further development to fully align with its "thinking paradigm" description.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.035
Completion $0.138

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Novita
Novita | thudm/glm-4.1v-9b-thinking 65K $0.035 / 1M tokens $0.138 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Free Executions Accuracy Cost Duration
Other Models by thudm