Qwen: Qwen3 VL 235B A22B Thinking

Text input Image input Text output
Author's Description

Qwen3-VL-235B-A22B Thinking is a multimodal model that unifies strong text generation with visual understanding across images and video. The Thinking model is optimized for multimodal reasoning in STEM and math....

Key Specifications
Cost
$$$$$
Context
131K
Parameters
235B
Released
Sep 23, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Top P Seed Presence Penalty Reasoning Include Reasoning Temperature Response Format Tools Tool Choice Max Tokens
Features

This model supports the following features:

Tools Reasoning Response Format
Performance Summary

Qwen3 VL 235B A22B Thinking consistently ranks among the fastest models and offers highly competitive pricing across various benchmarks. This multimodal model, designed for strong text generation and visual understanding, demonstrates notable strengths in specific areas. It achieves a high accuracy of 95.2% in Hallucinations (Baseline) testing, indicating a strong ability to acknowledge uncertainty, and an impressive 81.6% in Instruction Following (Baseline), showcasing its precision in adhering to complex directives. The model also performs well in Email Classification (Baseline) with 97.8% accuracy. However, the model exhibits significant weaknesses in core cognitive benchmarks. It scores 0.0% accuracy in Coding (Baseline), General Knowledge (Baseline), and Ethics (Baseline), suggesting a lack of foundational understanding or an inability to perform well in these specific multiple-choice formats. Its performance in Reasoning (Baseline) and Mathematics (Baseline) is also very low, at 3.4% and 3.1% accuracy respectively, despite its description emphasizing optimization for multimodal reasoning in STEM and math. While its speed and cost efficiency are exceptional, the model's current performance across several critical reasoning and knowledge-based tasks indicates areas requiring substantial improvement.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.26
Completion $2.6

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Alibaba
Alibaba | qwen/qwen3-vl-235b-a22b-thinking 131K $0.26 / 1M tokens $2.6 / 1M tokens
Novita
Novita | qwen/qwen3-vl-235b-a22b-thinking 131K $0.26 / 1M tokens $2.6 / 1M tokens
Parasail
Parasail | qwen/qwen3-vl-235b-a22b-thinking 65K $0.26 / 1M tokens $2.6 / 1M tokens
Parasail
Parasail | qwen/qwen3-vl-235b-a22b-thinking 262K $0.26 / 1M tokens $2.6 / 1M tokens
SiliconFlow
SiliconFlow | qwen/qwen3-vl-235b-a22b-thinking 262K $0.26 / 1M tokens $2.6 / 1M tokens
Chutes
Chutes | qwen/qwen3-vl-235b-a22b-thinking 262K $0.26 / 1M tokens $2.6 / 1M tokens
Novita
Novita | qwen/qwen3-vl-235b-a22b-thinking 131K $0.98 / 1M tokens $3.95 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by qwen