Author's Description
Qwen3-VL-235B-A22B Instruct is an open-weight multimodal model that unifies strong text generation with visual understanding across images and video. The Instruct model targets general vision-language use (VQA, document parsing, chart/table extraction, multilingual OCR). The series emphasizes robust perception (recognition of diverse real-world and synthetic categories), spatial understanding (2D/3D grounding), and long-form visual comprehension, with competitive results on public multimodal benchmarks for both perception and reasoning. Beyond analysis, Qwen3-VL supports agentic interaction and tool use: it can follow complex instructions over multi-image, multi-turn dialogues; align text to video timelines for precise temporal queries; and operate GUI elements for automation tasks. The models also enable visual coding workflows—turning sketches or mockups into code and assisting with UI debugging—while maintaining strong text-only performance comparable to the flagship Qwen3 language models. This makes Qwen3-VL suitable for production scenarios spanning document AI, multilingual OCR, software/UI assistance, spatial/embodied tasks, and research on vision-language agents.
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
Qwen3-VL-235B-A22B Instruct demonstrates competitive response times, performing among the faster models with a 55th percentile speed ranking. It also offers competitive pricing, ranking in the 58th percentile. Notably, the model exhibits exceptional reliability, achieving a 100% success rate across all benchmarks, indicating minimal technical failures. The model excels in several critical areas, achieving perfect accuracy in Hallucinations (100%), General Knowledge (100%), Reasoning (100%), and Ethics (100%). Its performance in these categories is often among the most accurate at its price point and speed. It also shows strong capabilities in Mathematics (92.3% accuracy) and Email Classification (98.0% accuracy). While its Instruction Following (65.7% accuracy) and Coding (80.0% accuracy) scores are respectable, they represent areas with potential for further improvement compared to its top-tier performance in other domains. Overall, Qwen3-VL-235B-A22B Instruct is a robust multimodal model with a strong foundation in perception, reasoning, and ethical understanding, making it suitable for diverse applications.
Model Pricing
Current Pricing
| Feature | Price (per 1M tokens) |
|---|---|
| Prompt | $0.7 |
| Completion | $2.8 |
Price History
Available Endpoints
| Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
|---|---|---|---|---|
|
Alibaba
|
Alibaba | qwen/qwen3-vl-235b-a22b-instruct | 131K | $0.7 / 1M tokens | $2.8 / 1M tokens |
|
Novita
|
Novita | qwen/qwen3-vl-235b-a22b-instruct | 131K | $0.22 / 1M tokens | $0.88 / 1M tokens |
|
Parasail
|
Parasail | qwen/qwen3-vl-235b-a22b-instruct | 262K | $0.22 / 1M tokens | $0.88 / 1M tokens |
|
Chutes
|
Chutes | qwen/qwen3-vl-235b-a22b-instruct | 131K | $0.22 / 1M tokens | $0.88 / 1M tokens |
|
Parasail
|
Parasail | qwen/qwen3-vl-235b-a22b-instruct | 131K | $0.5 / 1M tokens | $2.75 / 1M tokens |
|
SiliconFlow
|
SiliconFlow | qwen/qwen3-vl-235b-a22b-instruct | 262K | $0.3 / 1M tokens | $1.5 / 1M tokens |
|
DeepInfra
|
DeepInfra | qwen/qwen3-vl-235b-a22b-instruct | 131K | $0.22 / 1M tokens | $0.88 / 1M tokens |
|
DeepInfra
|
DeepInfra | qwen/qwen3-vl-235b-a22b-instruct | 262K | $0.3 / 1M tokens | $1.49 / 1M tokens |
|
Novita
|
Novita | qwen/qwen3-vl-235b-a22b-instruct | 131K | $0.3 / 1M tokens | $1.5 / 1M tokens |
|
Chutes
|
Chutes | qwen/qwen3-vl-235b-a22b-instruct | 262K | $0.3 / 1M tokens | $1.2 / 1M tokens |
|
Phala
|
Phala | qwen/qwen3-vl-235b-a22b-instruct | 131K | $0.22 / 1M tokens | $0.88 / 1M tokens |
|
Fireworks
|
Fireworks | qwen/qwen3-vl-235b-a22b-instruct | 262K | $0.22 / 1M tokens | $0.88 / 1M tokens |
|
AtlasCloud
|
AtlasCloud | qwen/qwen3-vl-235b-a22b-instruct | 131K | $0.3 / 1M tokens | $1.5 / 1M tokens |
Benchmark Results
| Benchmark | Category | Reasoning | Strategy | Free | Executions | Accuracy | Cost | Duration |
|---|
Other Models by qwen
|
|
Released | Params | Context |
|
Speed | Ability | Cost |
|---|---|---|---|---|---|---|---|
| Qwen: Qwen3 VL 32B Instruct Unavailable | Oct 23, 2025 | 32B | 262K |
Image input
Text input
Text output
|
★★★ | ★★★★★ | $$ |
| Qwen: Qwen3 VL 8B Thinking | Oct 14, 2025 | 8B | 256K |
Image input
Text input
Text output
|
★ | ★ | $$$$$ |
| Qwen: Qwen3 VL 8B Instruct | Oct 14, 2025 | 8B | 256K |
Image input
Text input
Text output
|
★ | ★★ | $$$ |
| Qwen: Qwen3 VL 30B A3B Thinking | Oct 06, 2025 | 30B | 262K |
Image input
Text input
Text output
|
★ | ★★★ | $$$$ |
| Qwen: Qwen3 VL 30B A3B Instruct | Oct 06, 2025 | 30B | 131K |
Image input
Text input
Text output
|
— | — | $$$ |
| Qwen: Qwen3 VL 235B A22B Thinking | Sep 23, 2025 | 235B | 131K |
Image input
Text input
Text output
|
★ | ★ | $$$$$ |
| Qwen: Qwen3 Max | Sep 23, 2025 | — | 256K |
Text input
Text output
|
★★★★ | ★★★★★ | $$$$ |
| Qwen: Qwen3 Coder Plus | Sep 23, 2025 | ~480B | 128K |
Text input
Text output
|
★★★★ | ★★★★ | $$$$ |
| Qwen: Qwen3 Coder Flash | Sep 17, 2025 | — | 128K |
Text input
Text output
|
★★★★ | ★★★ | $$$ |
| Qwen: Qwen3 Next 80B A3B Thinking | Sep 11, 2025 | 80B | 262K |
Text input
Text output
|
★ | ★★★★ | $$$$$ |
| Qwen: Qwen3 Next 80B A3B Instruct | Sep 11, 2025 | 80B | 262K |
Text input
Text output
|
★★★★ | ★★★★★ | $$$$ |
| Qwen: Qwen Plus 0728 | Sep 08, 2025 | ~20B | 1M |
Text input
Text output
|
★★★★★ | ★★★ | $$$ |
| Qwen: Qwen3 30B A3B Thinking 2507 | Aug 28, 2025 | 30B | 262K |
Text input
Text output
|
★★ | ★★★ | $$$$ |
| Qwen: Qwen3 Coder 30B A3B Instruct | Jul 31, 2025 | 30B | 262K |
Text input
Text output
|
★★★★ | ★★★ | $$ |
| Qwen: Qwen3 30B A3B Instruct 2507 | Jul 29, 2025 | 30B | 131K |
Text input
Text output
|
★★★★ | ★★★★ | $$$ |
| Qwen: Qwen3 235B A22B Thinking 2507 | Jul 25, 2025 | 235B | 131K |
Text input
Text output
|
★ | ★★★★ | $$$$$ |
| Qwen: Qwen3 Coder 480B A35B | Jul 22, 2025 | 480B | 1M |
Text input
Text output
|
★★ | ★★★ | $$$ |
| Qwen: Qwen3 Coder 480B A35B (exacto) | Jul 22, 2025 | 480B | 262K |
Text input
Text output
|
— | — | $$$$ |
| Qwen: Qwen3 235B A22B Instruct 2507 | Jul 21, 2025 | 235B | 262K |
Text input
Text output
|
★★ | ★★★ | $$$ |
| Qwen: Qwen3 30B A3B | Apr 28, 2025 | 30B | 40K |
Text input
Text output
|
★ | ★★★★★ | $$$$ |
| Qwen: Qwen3 8B | Apr 28, 2025 | 8B | 128K |
Text input
Text output
|
★ | ★★★ | $$$ |
| Qwen: Qwen3 14B | Apr 28, 2025 | 14B | 40K |
Text input
Text output
|
★★ | ★★★★ | $$$ |
| Qwen: Qwen3 32B | Apr 28, 2025 | 32B | 40K |
Text input
Text output
|
★ | ★★★★★ | $$$ |
| Qwen: Qwen3 235B A22B | Apr 28, 2025 | 235B | 40K |
Text input
Text output
|
★ | ★★★★ | $$$$ |
| Qwen: Qwen2.5 Coder 7B Instruct | Apr 15, 2025 | 7B | 32K |
Text input
Text output
|
— | — | $ |
| Qwen: Qwen2.5 VL 32B Instruct | Mar 24, 2025 | 32B | 128K |
Image input
Text input
Text output
|
★ | ★★★ | $$$ |
| Qwen: QwQ 32B | Mar 05, 2025 | 32B | 131K |
Text input
Text output
|
★ | ★★★ | $$$ |
| Qwen: Qwen VL Plus | Feb 04, 2025 | — | 7K |
Image input
Text input
Text output
|
★★★★ | ★★ | $$$ |
| Qwen: Qwen VL Max | Feb 01, 2025 | — | 131K |
Image input
Text input
Text output
|
★★★ | ★★★ | $$$$ |
| Qwen: Qwen-Turbo | Feb 01, 2025 | — | 1M |
Text input
Text output
|
★★★★★ | ★★★★ | $$ |
| Qwen: Qwen2.5 VL 72B Instruct | Feb 01, 2025 | 72B | 32K |
Image input
Text input
Text output
|
★★★★ | ★★★★ | $$ |
| Qwen: Qwen-Plus | Feb 01, 2025 | — | 131K |
Text input
Text output
|
★★★★ | ★★★★ | $$$ |
| Qwen: Qwen-Max | Feb 01, 2025 | — | 32K |
Text input
Text output
|
★★★★ | ★★★★ | $$$$ |
| Qwen: QwQ 32B Preview Unavailable | Nov 27, 2024 | 32B | 32K |
Text input
Text output
|
— | ★ | $$ |
| Qwen2.5 Coder 32B Instruct | Nov 11, 2024 | ~500B | 32K |
Text input
Text output
|
★★★★★ | ★★★★★ | $ |
| Qwen: Qwen2.5 7B Instruct | Oct 15, 2024 | ~500B | 32K |
Text input
Text output
|
★ | ★★ | $ |
| Qwen2.5 72B Instruct | Sep 18, 2024 | ~500B | 32K |
Text input
Text output
|
★★★ | ★★ | $$$ |
| Qwen: Qwen2.5-VL 7B Instruct | Aug 27, 2024 | ~500B | 32K |
Image input
Text input
Text output
|
★★★★ | ★★ | $$ |
| Qwen 2 72B Instruct Unavailable | Jun 06, 2024 | ~500B | 32K |
Text input
Text output
|
★★★★ | ★★ | $$$$ |