Author's Description
Qwen3, the latest generation in the Qwen large language model series, features both dense and mixture-of-experts (MoE) architectures to excel in reasoning, multilingual support, and advanced agent tasks. Its unique ability to switch seamlessly between a thinking mode for complex reasoning and a non-thinking mode for efficient dialogue ensures versatile, high-quality performance. Significantly outperforming prior models like QwQ and Qwen2.5, Qwen3 delivers superior mathematics, coding, commonsense reasoning, creative writing, and interactive dialogue capabilities. The Qwen3-30B-A3B variant includes 30.5 billion parameters (3.3 billion activated), 48 layers, 128 experts (8 activated per task), and supports up to 131K token contexts with YaRN, setting a new standard among open-source models.
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
Qwen3 30B A3B, a 30.5 billion parameter model from the Qwen3 series, demonstrates strong performance across a diverse range of benchmarks. While its speed ranking places it in the 16th percentile, indicating generally longer response times, it offers competitive pricing, ranking in the 47th percentile. A standout feature is its exceptional reliability, achieving a 100% success rate across all evaluated benchmarks, signifying consistent and stable operation. The model excels in several critical areas. It achieved perfect accuracy in both General Knowledge and Ethics, with the former also being noted as the most accurate model at its price point and among models of comparable speed. Its Coding and Reasoning capabilities are also very strong, scoring 94.0% and 96.0% accuracy respectively, placing it in the 92nd and 88th percentiles. Email Classification is another strength, with 99.0% accuracy. While its Hallucinations score of 96.0% is respectable, it indicates a slight tendency to not always acknowledge uncertainty. Instruction Following, at 60.8% accuracy, represents a relative area for improvement compared to its other high-performing categories. Mathematics performance is solid at 93.0% accuracy. Overall, Qwen3 30B A3B is a highly reliable model with significant strengths in knowledge, ethics, coding, and reasoning, making it a versatile option despite its slower processing speed.
Model Pricing
Current Pricing
Feature | Price (per 1M tokens) |
---|---|
Prompt | $0.08 |
Completion | $0.29 |
Price History
Available Endpoints
Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
---|---|---|---|---|
DeepInfra
|
DeepInfra | qwen/qwen3-30b-a3b-04-28 | 40K | $0.08 / 1M tokens | $0.29 / 1M tokens |
InferenceNet
|
InferenceNet | qwen/qwen3-30b-a3b-04-28 | 16K | $0.06 / 1M tokens | $0.22 / 1M tokens |
Parasail
|
Parasail | qwen/qwen3-30b-a3b-04-28 | 40K | $0.09 / 1M tokens | $0.5 / 1M tokens |
Nebius
|
Nebius | qwen/qwen3-30b-a3b-04-28 | 40K | $0.1 / 1M tokens | $0.3 / 1M tokens |
Novita
|
Novita | qwen/qwen3-30b-a3b-04-28 | 40K | $0.09 / 1M tokens | $0.45 / 1M tokens |
Fireworks
|
Fireworks | qwen/qwen3-30b-a3b-04-28 | 131K | $0.06 / 1M tokens | $0.22 / 1M tokens |
Friendli
|
Friendli | qwen/qwen3-30b-a3b-04-28 | 131K | $0.15 / 1M tokens | $0.6 / 1M tokens |
NextBit
|
NextBit | qwen/qwen3-30b-a3b-04-28 | 32K | $0.06 / 1M tokens | $0.22 / 1M tokens |
Chutes
|
Chutes | qwen/qwen3-30b-a3b-04-28 | 40K | $0.06 / 1M tokens | $0.22 / 1M tokens |
SiliconFlow
|
SiliconFlow | qwen/qwen3-30b-a3b-04-28 | 131K | $0.09 / 1M tokens | $0.45 / 1M tokens |
NCompass
|
NCompass | qwen/qwen3-30b-a3b-04-28 | 131K | $0.08 / 1M tokens | $0.28 / 1M tokens |
Crusoe
|
Crusoe | qwen/qwen3-30b-a3b-04-28 | 40K | $0.1 / 1M tokens | $0.3 / 1M tokens |
NextBit
|
NextBit | qwen/qwen3-30b-a3b-04-28 | 32K | $0.14 / 1M tokens | $0.55 / 1M tokens |
Chutes
|
Chutes | qwen/qwen3-30b-a3b-04-28 | 40K | $0.06 / 1M tokens | $0.22 / 1M tokens |
Benchmark Results
Benchmark | Category | Reasoning | Strategy | Free | Executions | Accuracy | Cost | Duration |
---|
Other Models by qwen
|
Released | Params | Context |
|
Speed | Ability | Cost |
---|---|---|---|---|---|---|---|
Qwen: Qwen3 VL 235B A22B Thinking | Sep 23, 2025 | 235B | 131K |
Text input
Image input
Text output
|
★ | ★ | $$$$$ |
Qwen: Qwen3 VL 235B A22B Instruct | Sep 23, 2025 | 235B | 131K |
Text input
Image input
Text output
|
★★★ | ★★★★★ | $$$ |
Qwen: Qwen3 Max | Sep 23, 2025 | — | 256K |
Text input
Text output
|
★★★★ | ★★★★★ | $$$$ |
Qwen: Qwen3 Coder Plus | Sep 23, 2025 | ~480B | 128K |
Text input
Text output
|
★★★★ | ★★★★ | $$$$ |
Qwen: Qwen3 Coder Flash | Sep 17, 2025 | — | 128K |
Text input
Text output
|
★★★★ | ★★★ | $$$ |
Qwen: Qwen3 Next 80B A3B Thinking | Sep 11, 2025 | 80B | 262K |
Text input
Text output
|
★ | ★★★★ | $$$$$ |
Qwen: Qwen3 Next 80B A3B Instruct | Sep 11, 2025 | 80B | 262K |
Text input
Text output
|
★★★★ | ★★★★★ | $$$$ |
Qwen: Qwen Plus 0728 | Sep 08, 2025 | ~20B | 1M |
Text input
Text output
|
★★★★★ | ★★★ | $$$ |
Qwen: Qwen3 30B A3B Thinking 2507 | Aug 28, 2025 | 30B | 262K |
Text input
Text output
|
★★ | ★★★ | $$$$ |
Qwen: Qwen3 Coder 30B A3B Instruct | Jul 31, 2025 | 30B | 262K |
Text input
Text output
|
★★★★ | ★★★ | $$ |
Qwen: Qwen3 30B A3B Instruct 2507 | Jul 29, 2025 | 30B | 131K |
Text input
Text output
|
★★★★ | ★★★★ | $$$ |
Qwen: Qwen3 235B A22B Thinking 2507 | Jul 25, 2025 | 235B | 131K |
Text input
Text output
|
★ | ★★★★ | $$$$$ |
Qwen: Qwen3 Coder 480B A35B | Jul 22, 2025 | 480B | 1M |
Text input
Text output
|
★ | ★★★ | $$$ |
Qwen: Qwen3 235B A22B Instruct 2507 | Jul 21, 2025 | 235B | 262K |
Text input
Text output
|
★★ | ★★★ | $$$ |
Qwen: Qwen3 8B | Apr 28, 2025 | 8B | 128K |
Text input
Text output
|
★ | ★★★ | $$$ |
Qwen: Qwen3 14B | Apr 28, 2025 | 14B | 40K |
Text input
Text output
|
★★ | ★★★★ | $$$ |
Qwen: Qwen3 32B | Apr 28, 2025 | 32B | 40K |
Text input
Text output
|
★ | ★★★★★ | $$$ |
Qwen: Qwen3 235B A22B | Apr 28, 2025 | 235B | 40K |
Text input
Text output
|
★ | ★★★★ | $$$$ |
Qwen: Qwen2.5 VL 32B Instruct | Mar 24, 2025 | 32B | 128K |
Text input
Image input
Text output
|
★ | ★★★ | $$$ |
Qwen: QwQ 32B | Mar 05, 2025 | 32B | 131K |
Text input
Text output
|
★ | ★★★ | $$$ |
Qwen: Qwen VL Plus | Feb 04, 2025 | — | 7K |
Text input
Image input
Text output
|
★★★★ | ★★ | $$$ |
Qwen: Qwen VL Max | Feb 01, 2025 | — | 7K |
Text input
Image input
Text output
|
★★★ | ★★★ | $$$$ |
Qwen: Qwen-Turbo | Feb 01, 2025 | — | 1M |
Text input
Text output
|
★★★★★ | ★★★★ | $$ |
Qwen: Qwen2.5 VL 72B Instruct | Feb 01, 2025 | 72B | 32K |
Text input
Image input
Text output
|
★★★★ | ★★★★ | $$ |
Qwen: Qwen-Plus | Feb 01, 2025 | — | 131K |
Text input
Text output
|
★★★★ | ★★★★ | $$$ |
Qwen: Qwen-Max | Feb 01, 2025 | — | 32K |
Text input
Text output
|
★★★★ | ★★★★ | $$$$ |
Qwen: QwQ 32B Preview Unavailable | Nov 27, 2024 | 32B | 32K |
Text input
Text output
|
— | ★ | $$ |
Qwen2.5 Coder 32B Instruct | Nov 11, 2024 | ~500B | 32K |
Text input
Text output
|
★★★★★ | ★★★★★ | $ |
Qwen: Qwen2.5 7B Instruct | Oct 15, 2024 | ~500B | 32K |
Text input
Text output
|
★ | ★★ | $ |
Qwen2.5 72B Instruct | Sep 18, 2024 | ~500B | 32K |
Text input
Text output
|
★★★ | ★★ | $$ |
Qwen: Qwen2.5-VL 7B Instruct | Aug 27, 2024 | ~500B | 32K |
Text input
Image input
Text output
|
★★★★ | ★★ | $$ |
Qwen 2 72B Instruct Unavailable | Jun 06, 2024 | ~500B | 32K |
Text input
Text output
|
★★★★ | ★★ | $$$$ |