Author's Description
Qwen3-235B-A22B is a 235B parameter mixture-of-experts (MoE) model developed by Qwen, activating 22B parameters per forward pass. It supports seamless switching between a "thinking" mode for complex reasoning, math, and code tasks, and a "non-thinking" mode for general conversational efficiency. The model demonstrates strong reasoning ability, multilingual support (100+ languages and dialects), advanced instruction-following, and agent tool-calling capabilities. It natively handles a 32K token context window and extends up to 131K tokens using YaRN-based scaling.
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
Qwen3-235B-A22B, a 235B parameter MoE model, demonstrates exceptional reliability, consistently providing usable responses with minimal technical failures, ranking in the 100th percentile. However, its speed performance is a notable area for improvement, with response times tending to be longer, placing it in the 7th percentile across benchmarks. Pricing is moderate, falling within the 36th percentile. The model exhibits outstanding accuracy across several critical benchmarks. It achieved perfect scores in Email Classification, Reasoning, and General Knowledge, often being the most accurate model at its price point and among models of similar speed. Its "thinking" mode appears highly effective for complex tasks. While its Ethics performance is strong at 99.0% accuracy, its Instruction Following accuracy is comparatively lower at 40.4%, suggesting potential limitations in handling highly complex, multi-layered instructions. Its ability to seamlessly switch between "thinking" and "non-thinking" modes, coupled with strong multilingual support and agent tool-calling capabilities, positions it as a versatile model, despite its slower processing times.
Model Pricing
Current Pricing
Feature | Price (per 1M tokens) |
---|---|
Prompt | $0.13 |
Completion | $0.6 |
Price History
Available Endpoints
Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
---|---|---|---|---|
DeepInfra
|
DeepInfra | qwen/qwen3-235b-a22b-04-28 | 40K | $0.13 / 1M tokens | $0.6 / 1M tokens |
Parasail
|
Parasail | qwen/qwen3-235b-a22b-04-28 | 40K | $0.13 / 1M tokens | $0.6 / 1M tokens |
Kluster
|
Kluster | qwen/qwen3-235b-a22b-04-28 | 40K | $0.13 / 1M tokens | $0.6 / 1M tokens |
GMICloud
|
GMICloud | qwen/qwen3-235b-a22b-04-28 | 32K | $0.13 / 1M tokens | $0.6 / 1M tokens |
Together
|
Together | qwen/qwen3-235b-a22b-04-28 | 40K | $0.2 / 1M tokens | $0.6 / 1M tokens |
Nebius
|
Nebius | qwen/qwen3-235b-a22b-04-28 | 40K | $0.2 / 1M tokens | $0.6 / 1M tokens |
Novita
|
Novita | qwen/qwen3-235b-a22b-04-28 | 40K | $0.13 / 1M tokens | $0.6 / 1M tokens |
Fireworks
|
Fireworks | qwen/qwen3-235b-a22b-04-28 | 131K | $0.22 / 1M tokens | $0.88 / 1M tokens |
Friendli
|
Friendli | qwen/qwen3-235b-a22b-04-28 | 131K | $0.13 / 1M tokens | $0.6 / 1M tokens |
Cerebras
|
Cerebras | qwen/qwen3-235b-a22b-04-28 | 40K | $0.13 / 1M tokens | $0.6 / 1M tokens |
Benchmark Results
Benchmark | Category | Reasoning | Free | Executions | Accuracy | Cost | Duration |
---|
Other Models by qwen
|
Released | Params | Context |
|
Speed | Ability | Cost |
---|---|---|---|---|---|---|---|
Qwen: Qwen3 30B A3B Instruct 2507 | Jul 29, 2025 | 30B | 131K |
Text input
Text output
|
★★★★ | ★★★★ | $$$ |
Qwen: Qwen3 235B A22B Thinking 2507 | Jul 25, 2025 | 235B | 131K |
Text input
Text output
|
★ | ★★★★ | $$$$$ |
Qwen: Qwen3 Coder | Jul 22, 2025 | 480B | 1M |
Text input
Text output
|
★★★★ | ★★★ | $$$ |
Qwen: Qwen3 235B A22B Instruct 2507 | Jul 21, 2025 | 235B | 262K |
Text input
Text output
|
★ | ★★★ | $$$ |
Qwen: Qwen3 30B A3B | Apr 28, 2025 | 30B | 40K |
Text input
Text output
|
★ | ★★★★★ | $$$$ |
Qwen: Qwen3 8B | Apr 28, 2025 | 8B | 128K |
Text input
Text output
|
★ | ★★★ | $$$ |
Qwen: Qwen3 14B | Apr 28, 2025 | 14B | 40K |
Text input
Text output
|
★★ | ★★★★★ | $$$ |
Qwen: Qwen3 32B | Apr 28, 2025 | 32B | 40K |
Text input
Text output
|
★ | ★★★★★ | $$$ |
Qwen: Qwen2.5 VL 32B Instruct | Mar 24, 2025 | 32B | 128K |
Text input
Image input
Text output
|
★★ | ★★★ | $$$ |
Qwen: QwQ 32B | Mar 05, 2025 | 32B | 131K |
Text input
Text output
|
★ | ★★★ | $$$ |
Qwen: Qwen VL Plus | Feb 04, 2025 | — | 7K |
Text input
Image input
Text output
|
★★★★ | ★★ | $$$ |
Qwen: Qwen VL Max | Feb 01, 2025 | — | 7K |
Text input
Image input
Text output
|
★★★★★ | ★★★ | $$$$ |
Qwen: Qwen-Turbo | Feb 01, 2025 | — | 1M |
Text input
Text output
|
★★★★★ | ★★★★ | $$ |
Qwen: Qwen2.5 VL 72B Instruct | Feb 01, 2025 | 72B | 32K |
Text input
Image input
Text output
|
★★★★ | ★★★★ | $$ |
Qwen: Qwen-Plus | Feb 01, 2025 | — | 131K |
Text input
Text output
|
★★★ | ★★★★ | $$$ |
Qwen: Qwen-Max | Feb 01, 2025 | — | 32K |
Text input
Text output
|
★★★ | ★★★★ | $$$$ |
Qwen: QwQ 32B Preview | Nov 27, 2024 | 32B | 32K |
Text input
Text output
|
— | ★ | $$ |
Qwen2.5 Coder 32B Instruct | Nov 11, 2024 | ~500B | 32K |
Text input
Text output
|
★★★★★ | ★★★★★ | $ |
Qwen2.5 7B Instruct | Oct 15, 2024 | ~500B | 32K |
Text input
Text output
|
★ | ★★★ | $$ |
Qwen2.5 72B Instruct | Sep 18, 2024 | ~500B | 32K |
Text input
Text output
|
★★★ | ★★★ | $$$ |
Qwen: Qwen2.5-VL 7B Instruct | Aug 27, 2024 | ~500B | 32K |
Text input
Image input
Text output
|
★★★ | ★★★ | $$$ |
Qwen 2 72B Instruct | Jun 06, 2024 | ~500B | 32K |
Text input
Text output
|
★★★★ | ★★ | $$$$ |