Author's Description
Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for tasks like math, coding, and logical inference, and a "non-thinking" mode for faster, general-purpose conversation. The model demonstrates strong performance in instruction-following, agent tool use, creative writing, and multilingual tasks across 100+ languages and dialects. It natively handles 32K token contexts and can extend to 131K tokens using YaRN-based scaling.
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
Qwen3-32B demonstrates exceptional reliability, achieving a 100% success rate across all benchmarks, indicating a highly stable and dependable model. While its speed performance tends to be slower, ranking in the 11th percentile, it offers competitive pricing, placing in the 53rd percentile. The model exhibits strong performance in several key areas. It achieves perfect accuracy in General Knowledge, Reasoning, and Ethics, highlighting its robust understanding and logical inference capabilities. Its Coding and Mathematics scores are also very impressive, ranking in the 96th and 98th percentiles respectively, showcasing its proficiency in complex problem-solving. Instruction Following is solid at 55.7% accuracy, and Email Classification is strong at 99.0%. A notable area for improvement is its hallucination rate, with 90.0% accuracy, placing it in the 36th percentile, suggesting it occasionally struggles to appropriately acknowledge uncertainty. Overall, Qwen3-32B is a highly reliable and accurate model, particularly strong in complex reasoning, coding, and mathematical tasks, making it well-suited for applications requiring precision and deep understanding, despite its longer response times.
Model Pricing
Current Pricing
Feature | Price (per 1M tokens) |
---|---|
Prompt | $0.1 |
Completion | $0.28 |
Price History
Available Endpoints
Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
---|---|---|---|---|
DeepInfra
|
DeepInfra | qwen/qwen3-32b-04-28 | 40K | $0.1 / 1M tokens | $0.28 / 1M tokens |
Nebius
|
Nebius | qwen/qwen3-32b-04-28 | 40K | $0.1 / 1M tokens | $0.3 / 1M tokens |
Lambda
|
Lambda | qwen/qwen3-32b-04-28 | 40K | $0.03 / 1M tokens | $0.13 / 1M tokens |
Novita
|
Novita | qwen/qwen3-32b-04-28 | 40K | $0.1 / 1M tokens | $0.45 / 1M tokens |
Parasail
|
Parasail | qwen/qwen3-32b-04-28 | 40K | $0.03 / 1M tokens | $0.13 / 1M tokens |
GMICloud
|
GMICloud | qwen/qwen3-32b-04-28 | 32K | $0.1 / 1M tokens | $0.6 / 1M tokens |
Nebius
|
Nebius | qwen/qwen3-32b-04-28 | 40K | $0.2 / 1M tokens | $0.6 / 1M tokens |
Cerebras
|
Cerebras | qwen/qwen3-32b-04-28 | 131K | $0.4 / 1M tokens | $0.8 / 1M tokens |
SambaNova
|
SambaNova | qwen/qwen3-32b-04-28 | 32K | $0.4 / 1M tokens | $0.8 / 1M tokens |
Groq
|
Groq | qwen/qwen3-32b-04-28 | 131K | $0.29 / 1M tokens | $0.59 / 1M tokens |
Friendli
|
Friendli | qwen/qwen3-32b-04-28 | 131K | $0.15 / 1M tokens | $0.5 / 1M tokens |
Chutes
|
Chutes | qwen/qwen3-32b-04-28 | 40K | $0.03 / 1M tokens | $0.13 / 1M tokens |
NCompass
|
NCompass | qwen/qwen3-32b-04-28 | 40K | $0.1 / 1M tokens | $0.28 / 1M tokens |
SiliconFlow
|
SiliconFlow | qwen/qwen3-32b-04-28 | 131K | $0.14 / 1M tokens | $0.57 / 1M tokens |
Chutes
|
Chutes | qwen/qwen3-32b-04-28 | 40K | $0.03 / 1M tokens | $0.13 / 1M tokens |
Benchmark Results
Benchmark | Category | Reasoning | Strategy | Free | Executions | Accuracy | Cost | Duration |
---|
Other Models by qwen
|
Released | Params | Context |
|
Speed | Ability | Cost |
---|---|---|---|---|---|---|---|
Qwen: Qwen3 VL 235B A22B Thinking | Sep 23, 2025 | 235B | 131K |
Text input
Image input
Text output
|
★ | ★ | $$$$$ |
Qwen: Qwen3 VL 235B A22B Instruct | Sep 23, 2025 | 235B | 131K |
Text input
Image input
Text output
|
★★★ | ★★★★★ | $$$ |
Qwen: Qwen3 Max | Sep 23, 2025 | — | 256K |
Text input
Text output
|
★★★★ | ★★★★★ | $$$$ |
Qwen: Qwen3 Coder Plus | Sep 23, 2025 | ~480B | 128K |
Text input
Text output
|
★★★★ | ★★★★ | $$$$ |
Qwen: Qwen3 Coder Flash | Sep 17, 2025 | — | 128K |
Text input
Text output
|
★★★★ | ★★★ | $$$ |
Qwen: Qwen3 Next 80B A3B Thinking | Sep 11, 2025 | 80B | 262K |
Text input
Text output
|
★ | ★★★★ | $$$$$ |
Qwen: Qwen3 Next 80B A3B Instruct | Sep 11, 2025 | 80B | 262K |
Text input
Text output
|
★★★★ | ★★★★★ | $$$$ |
Qwen: Qwen Plus 0728 | Sep 08, 2025 | ~20B | 1M |
Text input
Text output
|
★★★★★ | ★★★ | $$$ |
Qwen: Qwen3 30B A3B Thinking 2507 | Aug 28, 2025 | 30B | 262K |
Text input
Text output
|
★★ | ★★★ | $$$$ |
Qwen: Qwen3 Coder 30B A3B Instruct | Jul 31, 2025 | 30B | 262K |
Text input
Text output
|
★★★★ | ★★★ | $$ |
Qwen: Qwen3 30B A3B Instruct 2507 | Jul 29, 2025 | 30B | 131K |
Text input
Text output
|
★★★★ | ★★★★ | $$$ |
Qwen: Qwen3 235B A22B Thinking 2507 | Jul 25, 2025 | 235B | 131K |
Text input
Text output
|
★ | ★★★★ | $$$$$ |
Qwen: Qwen3 Coder 480B A35B | Jul 22, 2025 | 480B | 1M |
Text input
Text output
|
★ | ★★★ | $$$ |
Qwen: Qwen3 235B A22B Instruct 2507 | Jul 21, 2025 | 235B | 262K |
Text input
Text output
|
★★ | ★★★ | $$$ |
Qwen: Qwen3 30B A3B | Apr 28, 2025 | 30B | 40K |
Text input
Text output
|
★ | ★★★★★ | $$$$ |
Qwen: Qwen3 8B | Apr 28, 2025 | 8B | 128K |
Text input
Text output
|
★ | ★★★ | $$$ |
Qwen: Qwen3 14B | Apr 28, 2025 | 14B | 40K |
Text input
Text output
|
★★ | ★★★★ | $$$ |
Qwen: Qwen3 235B A22B | Apr 28, 2025 | 235B | 40K |
Text input
Text output
|
★ | ★★★★ | $$$$ |
Qwen: Qwen2.5 VL 32B Instruct | Mar 24, 2025 | 32B | 128K |
Text input
Image input
Text output
|
★ | ★★★ | $$$ |
Qwen: QwQ 32B | Mar 05, 2025 | 32B | 131K |
Text input
Text output
|
★ | ★★★ | $$$ |
Qwen: Qwen VL Plus | Feb 04, 2025 | — | 7K |
Text input
Image input
Text output
|
★★★★ | ★★ | $$$ |
Qwen: Qwen VL Max | Feb 01, 2025 | — | 7K |
Text input
Image input
Text output
|
★★★ | ★★★ | $$$$ |
Qwen: Qwen-Turbo | Feb 01, 2025 | — | 1M |
Text input
Text output
|
★★★★★ | ★★★★ | $$ |
Qwen: Qwen2.5 VL 72B Instruct | Feb 01, 2025 | 72B | 32K |
Text input
Image input
Text output
|
★★★★ | ★★★★ | $$ |
Qwen: Qwen-Plus | Feb 01, 2025 | — | 131K |
Text input
Text output
|
★★★★ | ★★★★ | $$$ |
Qwen: Qwen-Max | Feb 01, 2025 | — | 32K |
Text input
Text output
|
★★★★ | ★★★★ | $$$$ |
Qwen: QwQ 32B Preview Unavailable | Nov 27, 2024 | 32B | 32K |
Text input
Text output
|
— | ★ | $$ |
Qwen2.5 Coder 32B Instruct | Nov 11, 2024 | ~500B | 32K |
Text input
Text output
|
★★★★★ | ★★★★★ | $ |
Qwen: Qwen2.5 7B Instruct | Oct 15, 2024 | ~500B | 32K |
Text input
Text output
|
★ | ★★ | $ |
Qwen2.5 72B Instruct | Sep 18, 2024 | ~500B | 32K |
Text input
Text output
|
★★★ | ★★ | $$ |
Qwen: Qwen2.5-VL 7B Instruct | Aug 27, 2024 | ~500B | 32K |
Text input
Image input
Text output
|
★★★★ | ★★ | $$ |
Qwen 2 72B Instruct Unavailable | Jun 06, 2024 | ~500B | 32K |
Text input
Text output
|
★★★★ | ★★ | $$$$ |