Z.AI: GLM 4.6V

Image input Text input Video input Text output
Author's Description

GLM-4.6V is a large multimodal model designed for high-fidelity visual understanding and long-context reasoning across images, documents, and mixed media. It supports up to 128K tokens, processes complex page layouts and charts directly as visual inputs, and integrates native multimodal function calling to connect perception with downstream tool execution. The model also enables interleaved image-text generation and UI reconstruction workflows, including screenshot-to-HTML synthesis and iterative visual editing.

Key Specifications
Cost
$$$$
Context
131K
Released
Dec 08, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Reasoning Max Tokens Temperature Top P Include Reasoning
Features

This model supports the following features:

Reasoning
Performance Summary

Z.AI: GLM 4.6V, a large multimodal model from z-ai, demonstrates a balanced performance profile with notable strengths in reliability and ethical reasoning. Created on December 8, 2025, this model is designed for high-fidelity visual understanding and long-context reasoning, supporting up to 128K tokens and processing complex visual inputs directly. Its speed performance is moderate, ranking in the 37th percentile across two benchmarks, indicating it performs at an average pace compared to other models. Similarly, its pricing is moderate, positioned in the 34th percentile, suggesting it offers competitive costs without being the cheapest option. A standout feature is its exceptional reliability, achieving a 100% success rate across two benchmarks, signifying consistent and dependable operation with minimal technical failures. In terms of benchmark results, GLM 4.6V achieved a strong 99.5% accuracy in General Knowledge, placing it in the 77th percentile, though its cost and duration for this category were moderate. Its performance in the Ethics benchmark was particularly impressive, achieving a perfect 100.0% accuracy. This makes it the most accurate model at its price point and among models of similar speed for ethical reasoning. Key strengths include its robust reliability, superior ethical reasoning capabilities, and strong general knowledge. While its speed and price are moderate, its consistent performance and specialized multimodal features make it a compelling option for applications requiring high accuracy and dependable operation, especially in visually complex and ethically sensitive domains.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.3
Completion $0.9
Input Cache Read $0.05

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Z.AI
Z.AI | z-ai/glm-4.6-20251208 131K $0.3 / 1M tokens $0.9 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by z-ai