Author's Description
Qwen3, the latest generation in the Qwen large language model series, features both dense and mixture-of-experts (MoE) architectures to excel in reasoning, multilingual support, and advanced agent tasks. Its unique ability to switch seamlessly between a thinking mode for complex reasoning and a non-thinking mode for efficient dialogue ensures versatile, high-quality performance. Significantly outperforming prior models like QwQ and Qwen2.5, Qwen3 delivers superior mathematics, coding, commonsense reasoning, creative writing, and interactive dialogue capabilities. The Qwen3-30B-A3B variant includes 30.5 billion parameters (3.3 billion activated), 48 layers, 128 experts (8 activated per task), and supports up to 131K token contexts with YaRN, setting a new standard among open-source models.
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
Qwen3 30B A3B, the latest Qwen model, demonstrates exceptional performance across a range of benchmarks, particularly in its reliability. It achieves a perfect 100th percentile in reliability, consistently providing usable responses with minimal technical failures. While its speed tends to be slower, ranking in the 17th percentile, it offers competitive pricing, falling in the 49th percentile. The model excels in critical areas, achieving perfect 100% accuracy in both Ethics and General Knowledge, with the latter also being among the top three in accuracy and most accurate among models of comparable speed and price. It also shows strong capabilities in Reasoning (98% accuracy) and Coding (94% accuracy), placing it in the 95th percentile for both. Email Classification is also a strength at 99% accuracy. Its primary area for improvement is Instruction Following, where it achieved 60.8% accuracy. Overall, Qwen3 30B A3B stands out for its high accuracy in complex reasoning and knowledge-based tasks, coupled with outstanding reliability, making it a robust choice despite its longer response times.
Model Pricing
Current Pricing
Feature | Price (per 1M tokens) |
---|---|
Prompt | $0.08 |
Completion | $0.29 |
Price History
Available Endpoints
Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
---|---|---|---|---|
DeepInfra
|
DeepInfra | qwen/qwen3-30b-a3b-04-28 | 40K | $0.08 / 1M tokens | $0.29 / 1M tokens |
InferenceNet
|
InferenceNet | qwen/qwen3-30b-a3b-04-28 | 16K | $0.02 / 1M tokens | $0.08 / 1M tokens |
Parasail
|
Parasail | qwen/qwen3-30b-a3b-04-28 | 40K | $0.09 / 1M tokens | $0.5 / 1M tokens |
Nebius
|
Nebius | qwen/qwen3-30b-a3b-04-28 | 40K | $0.1 / 1M tokens | $0.3 / 1M tokens |
Novita
|
Novita | qwen/qwen3-30b-a3b-04-28 | 40K | $0.1 / 1M tokens | $0.45 / 1M tokens |
Fireworks
|
Fireworks | qwen/qwen3-30b-a3b-04-28 | 131K | $0.02 / 1M tokens | $0.08 / 1M tokens |
Friendli
|
Friendli | qwen/qwen3-30b-a3b-04-28 | 131K | $0.15 / 1M tokens | $0.6 / 1M tokens |
NextBit
|
NextBit | qwen/qwen3-30b-a3b-04-28 | 32K | $0.02 / 1M tokens | $0.08 / 1M tokens |
Chutes
|
Chutes | qwen/qwen3-30b-a3b-04-28 | 40K | $0.02 / 1M tokens | $0.08 / 1M tokens |
Benchmark Results
Benchmark | Category | Reasoning | Free | Executions | Accuracy | Cost | Duration |
---|
Other Models by qwen
|
Released | Params | Context |
|
Speed | Ability | Cost |
---|---|---|---|---|---|---|---|
Qwen: Qwen3 30B A3B Instruct 2507 | Jul 29, 2025 | 30B | 131K |
Text input
Text output
|
★★★★ | ★★★★ | $$$ |
Qwen: Qwen3 235B A22B Thinking 2507 | Jul 25, 2025 | 235B | 131K |
Text input
Text output
|
★ | ★★★★ | $$$$$ |
Qwen: Qwen3 Coder | Jul 22, 2025 | 480B | 1M |
Text input
Text output
|
★★★★ | ★★★ | $$$ |
Qwen: Qwen3 235B A22B Instruct 2507 | Jul 21, 2025 | 235B | 262K |
Text input
Text output
|
★ | ★★★ | $$$ |
Qwen: Qwen3 8B | Apr 28, 2025 | 8B | 128K |
Text input
Text output
|
★ | ★★★ | $$$ |
Qwen: Qwen3 14B | Apr 28, 2025 | 14B | 40K |
Text input
Text output
|
★★ | ★★★★★ | $$$ |
Qwen: Qwen3 32B | Apr 28, 2025 | 32B | 40K |
Text input
Text output
|
★ | ★★★★★ | $$$ |
Qwen: Qwen3 235B A22B | Apr 28, 2025 | 235B | 40K |
Text input
Text output
|
★ | ★★★★ | $$$$ |
Qwen: Qwen2.5 VL 32B Instruct | Mar 24, 2025 | 32B | 128K |
Text input
Image input
Text output
|
★★ | ★★★ | $$$ |
Qwen: QwQ 32B | Mar 05, 2025 | 32B | 131K |
Text input
Text output
|
★ | ★★★ | $$$ |
Qwen: Qwen VL Plus | Feb 04, 2025 | — | 7K |
Text input
Image input
Text output
|
★★★★ | ★★ | $$$ |
Qwen: Qwen VL Max | Feb 01, 2025 | — | 7K |
Text input
Image input
Text output
|
★★★★★ | ★★★ | $$$$ |
Qwen: Qwen-Turbo | Feb 01, 2025 | — | 1M |
Text input
Text output
|
★★★★★ | ★★★★ | $$ |
Qwen: Qwen2.5 VL 72B Instruct | Feb 01, 2025 | 72B | 32K |
Text input
Image input
Text output
|
★★★★ | ★★★★ | $$ |
Qwen: Qwen-Plus | Feb 01, 2025 | — | 131K |
Text input
Text output
|
★★★ | ★★★★ | $$$ |
Qwen: Qwen-Max | Feb 01, 2025 | — | 32K |
Text input
Text output
|
★★★ | ★★★★ | $$$$ |
Qwen: QwQ 32B Preview | Nov 27, 2024 | 32B | 32K |
Text input
Text output
|
— | ★ | $$ |
Qwen2.5 Coder 32B Instruct | Nov 11, 2024 | ~500B | 32K |
Text input
Text output
|
★★★★★ | ★★★★★ | $ |
Qwen2.5 7B Instruct | Oct 15, 2024 | ~500B | 32K |
Text input
Text output
|
★ | ★★★ | $$ |
Qwen2.5 72B Instruct | Sep 18, 2024 | ~500B | 32K |
Text input
Text output
|
★★★ | ★★★ | $$$ |
Qwen: Qwen2.5-VL 7B Instruct | Aug 27, 2024 | ~500B | 32K |
Text input
Image input
Text output
|
★★★ | ★★★ | $$$ |
Qwen 2 72B Instruct | Jun 06, 2024 | ~500B | 32K |
Text input
Text output
|
★★★★ | ★★ | $$$$ |