Author's Description
Qwen3-235B-A22B-Instruct-2507 is a multilingual, instruction-tuned mixture-of-experts language model based on the Qwen3-235B architecture, with 22B active parameters per forward pass. It is optimized for general-purpose text generation, including instruction following, logical reasoning, math, code, and tool usage. The model supports a native 262K context length and does not implement "thinking mode" (<think> blocks). Compared to its base variant, this version delivers significant gains in knowledge coverage, long-context reasoning, coding benchmarks, and alignment with open-ended tasks. It is particularly strong on multilingual understanding, math reasoning (e.g., AIME, HMMT), and alignment evaluations like Arena-Hard and WritingBench.
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
Qwen3-235B-A22B-Instruct-2507 demonstrates competitive response times, performing among the faster models with a 41st percentile speed ranking. It also offers cost-effective solutions, ranking in the 66th percentile for price. Notably, the model exhibits exceptional reliability, achieving a 99% success rate across benchmarks, indicating minimal technical failures. The model excels in several key areas. It achieved perfect accuracy in Keyword Topic Relevance Classification and Ethics, often being the most accurate model at its price point and speed. Its performance in Mathematics is outstanding, securing the #1 spot in accuracy with 97.0%, even on PhD-level problems. General Knowledge is also a strong suit, with 99.5% accuracy. While strong in Coding (85.9%) and Reasoning (84.0%), its Instruction Following accuracy (39.7%) and Email Classification (94.0%) are areas for potential improvement, ranking in the lower percentiles for those categories. Overall, this model is particularly strong in multilingual understanding, mathematical reasoning, and alignment evaluations.
Model Pricing
Current Pricing
Feature | Price (per 1M tokens) |
---|---|
Prompt | $0.08 |
Completion | $0.55 |
Price History
Available Endpoints
Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
---|---|---|---|---|
Parasail
|
Parasail | qwen/qwen3-235b-a22b-07-25 | 262K | $0.08 / 1M tokens | $0.55 / 1M tokens |
DeepInfra
|
DeepInfra | qwen/qwen3-235b-a22b-07-25 | 262K | $0.09 / 1M tokens | $0.6 / 1M tokens |
Targon
|
Targon | qwen/qwen3-235b-a22b-07-25 | 262K | $0.08 / 1M tokens | $0.55 / 1M tokens |
Parasail
|
Parasail | qwen/qwen3-235b-a22b-07-25 | 262K | $0.15 / 1M tokens | $0.85 / 1M tokens |
Fireworks
|
Fireworks | qwen/qwen3-235b-a22b-07-25 | 262K | $0.22 / 1M tokens | $0.88 / 1M tokens |
Targon
|
Targon | qwen/qwen3-235b-a22b-07-25 | 262K | $0.12 / 1M tokens | $0.59 / 1M tokens |
Alibaba
|
Alibaba | qwen/qwen3-235b-a22b-07-25 | 131K | $0.7 / 1M tokens | $2.8 / 1M tokens |
Together
|
Together | qwen/qwen3-235b-a22b-07-25 | 262K | $0.2 / 1M tokens | $0.6 / 1M tokens |
Novita
|
Novita | qwen/qwen3-235b-a22b-07-25 | 262K | $0.08 / 1M tokens | $0.55 / 1M tokens |
GMICloud
|
GMICloud | qwen/qwen3-235b-a22b-07-25 | 131K | $0.08 / 1M tokens | $0.55 / 1M tokens |
Novita
|
Novita | qwen/qwen3-235b-a22b-07-25 | 131K | $0.15 / 1M tokens | $0.8 / 1M tokens |
Cerebras
|
Cerebras | qwen/qwen3-235b-a22b-07-25 | 131K | $0.6 / 1M tokens | $1.2 / 1M tokens |
Chutes
|
Chutes | qwen/qwen3-235b-a22b-07-25 | 262K | $0.08 / 1M tokens | $0.55 / 1M tokens |
Nebius
|
Nebius | qwen/qwen3-235b-a22b-07-25 | 262K | $0.2 / 1M tokens | $0.6 / 1M tokens |
BaseTen
|
BaseTen | qwen/qwen3-235b-a22b-07-25 | 262K | $0.22 / 1M tokens | $0.8 / 1M tokens |
AtlasCloud
|
AtlasCloud | qwen/qwen3-235b-a22b-07-25 | 262K | $0.35 / 1M tokens | $1.2 / 1M tokens |
Google
|
Google | qwen/qwen3-235b-a22b-07-25 | 262K | $0.25 / 1M tokens | $1 / 1M tokens |
Friendli
|
Friendli | qwen/qwen3-235b-a22b-07-25 | 131K | $0.2 / 1M tokens | $0.8 / 1M tokens |
SiliconFlow
|
SiliconFlow | qwen/qwen3-235b-a22b-07-25 | 262K | $0.09 / 1M tokens | $0.6 / 1M tokens |
WandB
|
WandB | qwen/qwen3-235b-a22b-07-25 | 262K | $0.1 / 1M tokens | $0.1 / 1M tokens |
Chutes
|
Chutes | qwen/qwen3-235b-a22b-07-25 | 262K | $0.08 / 1M tokens | $0.55 / 1M tokens |
Benchmark Results
Benchmark | Category | Reasoning | Strategy | Free | Executions | Accuracy | Cost | Duration |
---|
Other Models by qwen
|
Released | Params | Context |
|
Speed | Ability | Cost |
---|---|---|---|---|---|---|---|
Qwen: Qwen3 VL 235B A22B Thinking | Sep 23, 2025 | 235B | 131K |
Text input
Image input
Text output
|
★ | ★ | $$$$$ |
Qwen: Qwen3 VL 235B A22B Instruct | Sep 23, 2025 | 235B | 131K |
Text input
Image input
Text output
|
★★★ | ★★★★★ | $$$ |
Qwen: Qwen3 Max | Sep 23, 2025 | — | 256K |
Text input
Text output
|
★★★★ | ★★★★★ | $$$$ |
Qwen: Qwen3 Coder Plus | Sep 23, 2025 | ~480B | 128K |
Text input
Text output
|
★★★★ | ★★★★ | $$$$ |
Qwen: Qwen3 Coder Flash | Sep 17, 2025 | — | 128K |
Text input
Text output
|
★★★★ | ★★★ | $$$ |
Qwen: Qwen3 Next 80B A3B Thinking | Sep 11, 2025 | 80B | 262K |
Text input
Text output
|
★ | ★★★★ | $$$$$ |
Qwen: Qwen3 Next 80B A3B Instruct | Sep 11, 2025 | 80B | 262K |
Text input
Text output
|
★★★★ | ★★★★★ | $$$$ |
Qwen: Qwen Plus 0728 | Sep 08, 2025 | ~20B | 1M |
Text input
Text output
|
★★★★★ | ★★★ | $$$ |
Qwen: Qwen3 30B A3B Thinking 2507 | Aug 28, 2025 | 30B | 262K |
Text input
Text output
|
★★ | ★★★ | $$$$ |
Qwen: Qwen3 Coder 30B A3B Instruct | Jul 31, 2025 | 30B | 262K |
Text input
Text output
|
★★★★ | ★★★ | $$ |
Qwen: Qwen3 30B A3B Instruct 2507 | Jul 29, 2025 | 30B | 131K |
Text input
Text output
|
★★★★ | ★★★★ | $$$ |
Qwen: Qwen3 235B A22B Thinking 2507 | Jul 25, 2025 | 235B | 131K |
Text input
Text output
|
★ | ★★★★ | $$$$$ |
Qwen: Qwen3 Coder 480B A35B | Jul 22, 2025 | 480B | 1M |
Text input
Text output
|
★ | ★★★ | $$$ |
Qwen: Qwen3 30B A3B | Apr 28, 2025 | 30B | 40K |
Text input
Text output
|
★ | ★★★★★ | $$$$ |
Qwen: Qwen3 8B | Apr 28, 2025 | 8B | 128K |
Text input
Text output
|
★ | ★★★ | $$$ |
Qwen: Qwen3 14B | Apr 28, 2025 | 14B | 40K |
Text input
Text output
|
★★ | ★★★★ | $$$ |
Qwen: Qwen3 32B | Apr 28, 2025 | 32B | 40K |
Text input
Text output
|
★ | ★★★★★ | $$$ |
Qwen: Qwen3 235B A22B | Apr 28, 2025 | 235B | 40K |
Text input
Text output
|
★ | ★★★★ | $$$$ |
Qwen: Qwen2.5 VL 32B Instruct | Mar 24, 2025 | 32B | 128K |
Text input
Image input
Text output
|
★ | ★★★ | $$$ |
Qwen: QwQ 32B | Mar 05, 2025 | 32B | 131K |
Text input
Text output
|
★ | ★★★ | $$$ |
Qwen: Qwen VL Plus | Feb 04, 2025 | — | 7K |
Text input
Image input
Text output
|
★★★★ | ★★ | $$$ |
Qwen: Qwen VL Max | Feb 01, 2025 | — | 7K |
Text input
Image input
Text output
|
★★★ | ★★★ | $$$$ |
Qwen: Qwen-Turbo | Feb 01, 2025 | — | 1M |
Text input
Text output
|
★★★★★ | ★★★★ | $$ |
Qwen: Qwen2.5 VL 72B Instruct | Feb 01, 2025 | 72B | 32K |
Text input
Image input
Text output
|
★★★★ | ★★★★ | $$ |
Qwen: Qwen-Plus | Feb 01, 2025 | — | 131K |
Text input
Text output
|
★★★★ | ★★★★ | $$$ |
Qwen: Qwen-Max | Feb 01, 2025 | — | 32K |
Text input
Text output
|
★★★★ | ★★★★ | $$$$ |
Qwen: QwQ 32B Preview Unavailable | Nov 27, 2024 | 32B | 32K |
Text input
Text output
|
— | ★ | $$ |
Qwen2.5 Coder 32B Instruct | Nov 11, 2024 | ~500B | 32K |
Text input
Text output
|
★★★★★ | ★★★★★ | $ |
Qwen: Qwen2.5 7B Instruct | Oct 15, 2024 | ~500B | 32K |
Text input
Text output
|
★ | ★★ | $ |
Qwen2.5 72B Instruct | Sep 18, 2024 | ~500B | 32K |
Text input
Text output
|
★★★ | ★★ | $$ |
Qwen: Qwen2.5-VL 7B Instruct | Aug 27, 2024 | ~500B | 32K |
Text input
Image input
Text output
|
★★★★ | ★★ | $$ |
Qwen 2 72B Instruct Unavailable | Jun 06, 2024 | ~500B | 32K |
Text input
Text output
|
★★★★ | ★★ | $$$$ |