Qwen: Qwen3 VL 235B A22B Instruct

Text input Image input Text output
Author's Description

Qwen3-VL-235B-A22B Instruct is an open-weight multimodal model that unifies strong text generation with visual understanding across images and video. The Instruct model targets general vision-language use (VQA, document parsing, chart/table...

Key Specifications
Cost
$$$
Context
262K
Parameters
235B
Released
Sep 23, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Tools Tool Choice Temperature Presence Penalty Max Tokens Seed Structured Outputs Response Format Top P
Features

This model supports the following features:

Structured Outputs Response Format Tools
Performance Summary

Qwen3 VL 235B A22B Instruct, a multimodal model from Qwen, demonstrates a strong and reliable performance profile. It exhibits competitive response times, performing among the faster models with a 58th percentile speed ranking across benchmarks. Furthermore, it offers cost-effective solutions, ranking in the 60th percentile for price. A standout feature is its exceptional reliability, achieving a perfect 100% success rate across all evaluated benchmarks, indicating minimal technical failures. The model excels in several critical areas, achieving perfect 100% accuracy in Hallucinations (correctly identifying fictional concepts), General Knowledge, Reasoning, and Ethics. This highlights its robust understanding and ability to acknowledge uncertainty, recall broad information, perform complex logical deductions, and adhere to ethical principles. While its Instruction Following accuracy is solid at 65.7%, there's room for improvement in handling highly complex, multi-layered instructions. Its Mathematics performance is strong at 92.3%, and it shows decent capability in Email Classification (98.0%). Coding is a relative weakness at 80.0% accuracy, placing it in the lower third of models for this category. Overall, Qwen3 VL 235B A22B Instruct is a highly reliable and accurate model, particularly strong in knowledge, reasoning, and ethical considerations, making it suitable for diverse production scenarios.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.2
Completion $0.88
Input Cache Read $0.11

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Alibaba
Alibaba | qwen/qwen3-vl-235b-a22b-instruct 131K $0.26 / 1M tokens $1.04 / 1M tokens
Novita
Novita | qwen/qwen3-vl-235b-a22b-instruct 131K $0.2 / 1M tokens $0.88 / 1M tokens
Parasail
Parasail | qwen/qwen3-vl-235b-a22b-instruct 262K $0.2 / 1M tokens $0.88 / 1M tokens
Chutes
Chutes | qwen/qwen3-vl-235b-a22b-instruct 131K $0.2 / 1M tokens $0.88 / 1M tokens
Parasail
Parasail | qwen/qwen3-vl-235b-a22b-instruct 131K $0.21 / 1M tokens $1.9 / 1M tokens
SiliconFlow
SiliconFlow | qwen/qwen3-vl-235b-a22b-instruct 262K $0.3 / 1M tokens $1.5 / 1M tokens
DeepInfra
DeepInfra | qwen/qwen3-vl-235b-a22b-instruct 131K $0.2 / 1M tokens $0.88 / 1M tokens
DeepInfra
DeepInfra | qwen/qwen3-vl-235b-a22b-instruct 262K $0.2 / 1M tokens $0.88 / 1M tokens
Novita
Novita | qwen/qwen3-vl-235b-a22b-instruct 131K $0.3 / 1M tokens $1.5 / 1M tokens
Chutes
Chutes | qwen/qwen3-vl-235b-a22b-instruct 262K $0.2 / 1M tokens $0.88 / 1M tokens
Phala
Phala | qwen/qwen3-vl-235b-a22b-instruct 131K $0.2 / 1M tokens $0.88 / 1M tokens
Fireworks
Fireworks | qwen/qwen3-vl-235b-a22b-instruct 262K $0.2 / 1M tokens $0.88 / 1M tokens
AtlasCloud
AtlasCloud | qwen/qwen3-vl-235b-a22b-instruct 131K $0.3 / 1M tokens $1.5 / 1M tokens
GMICloud
GMICloud | qwen/qwen3-vl-235b-a22b-instruct 262K $0.2 / 1M tokens $0.88 / 1M tokens
Ionstream
Ionstream | qwen/qwen3-vl-235b-a22b-instruct 131K $0.2 / 1M tokens $0.88 / 1M tokens
Venice
Venice | qwen/qwen3-vl-235b-a22b-instruct 256K $0.25 / 1M tokens $1.5 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by qwen