Author's Description
QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, especially hard problems. QwQ-32B is the medium-sized reasoning model, which is capable of achieving competitive performance against state-of-the-art reasoning models, e.g., DeepSeek-R1, o1-mini.
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
Qwen: QwQ 32B, a reasoning model from Qwen, demonstrates exceptional overall performance, particularly excelling in speed and reliability. It consistently ranks among the fastest models, achieving an Infinityth percentile in speed across seven benchmarks. Pricing is competitive, placing it in the 54th percentile. Reliability is a significant strength, with a perfect 100% success rate across all benchmarks, indicating minimal technical failures. In terms of specific performance, QwQ 32B shows strong capabilities in reasoning and knowledge-based tasks, achieving 98.0% accuracy in Reasoning (95th percentile) and 99.0% in General Knowledge (71st percentile). Its Ethics performance is perfect at 100% accuracy, notably being the most accurate model at its price point and speed. Coding accuracy is also robust at 91.0% (80th percentile). While its initial Instruction Following (Baseline) result was 0.0%, a subsequent run showed a more representative 53.0% accuracy. The model's primary strength lies in its reasoning capabilities, aligning with its description as a reasoning model. Its main weakness appears to be inconsistency in instruction following, as evidenced by the disparate results.
Model Pricing
Current Pricing
Feature | Price (per 1M tokens) |
---|---|
Prompt | $0.075 |
Completion | $0.15 |
Price History
Available Endpoints
Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
---|---|---|---|---|
DeepInfra
|
DeepInfra | qwen/qwq-32b | 131K | $0.075 / 1M tokens | $0.15 / 1M tokens |
Nebius
|
Nebius | qwen/qwq-32b | 131K | $0.5 / 1M tokens | $1.5 / 1M tokens |
InferenceNet
|
InferenceNet | qwen/qwq-32b | 16K | $0.075 / 1M tokens | $0.15 / 1M tokens |
Groq
|
Groq | qwen/qwq-32b | 131K | $0.075 / 1M tokens | $0.15 / 1M tokens |
Hyperbolic
|
Hyperbolic | qwen/qwq-32b | 131K | $0.4 / 1M tokens | $0.4 / 1M tokens |
SambaNova
|
SambaNova | qwen/qwq-32b | 16K | $0.075 / 1M tokens | $0.15 / 1M tokens |
Cent-ML
|
Cent-ML | qwen/qwq-32b | 40K | $0.075 / 1M tokens | $0.15 / 1M tokens |
Fireworks
|
Fireworks | qwen/qwq-32b | 131K | $0.075 / 1M tokens | $0.15 / 1M tokens |
Together
|
Together | qwen/qwq-32b | 131K | $1.2 / 1M tokens | $1.2 / 1M tokens |
Nineteen
|
Nineteen | qwen/qwq-32b | 40K | $0.075 / 1M tokens | $0.15 / 1M tokens |
NextBit
|
NextBit | qwen/qwq-32b | 32K | $0.15 / 1M tokens | $0.4 / 1M tokens |
Benchmark Results
Benchmark | Category | Reasoning | Free | Executions | Accuracy | Cost | Duration |
---|
Other Models by qwen
|
Released | Params | Context |
|
Speed | Ability | Cost |
---|---|---|---|---|---|---|---|
Qwen: Qwen3 30B A3B Instruct 2507 | Jul 29, 2025 | 30B | 131K |
Text input
Text output
|
★★★★ | ★★★★ | $$$ |
Qwen: Qwen3 235B A22B Thinking 2507 | Jul 25, 2025 | 235B | 131K |
Text input
Text output
|
★ | ★★★★ | $$$$$ |
Qwen: Qwen3 Coder | Jul 22, 2025 | 480B | 1M |
Text input
Text output
|
★★★★ | ★★★ | $$$ |
Qwen: Qwen3 235B A22B Instruct 2507 | Jul 21, 2025 | 235B | 262K |
Text input
Text output
|
★ | ★★★ | $$$ |
Qwen: Qwen3 30B A3B | Apr 28, 2025 | 30B | 40K |
Text input
Text output
|
★ | ★★★★★ | $$$$ |
Qwen: Qwen3 8B | Apr 28, 2025 | 8B | 128K |
Text input
Text output
|
★ | ★★★ | $$$ |
Qwen: Qwen3 14B | Apr 28, 2025 | 14B | 40K |
Text input
Text output
|
★★ | ★★★★★ | $$$ |
Qwen: Qwen3 32B | Apr 28, 2025 | 32B | 40K |
Text input
Text output
|
★ | ★★★★★ | $$$ |
Qwen: Qwen3 235B A22B | Apr 28, 2025 | 235B | 40K |
Text input
Text output
|
★ | ★★★★ | $$$$ |
Qwen: Qwen2.5 VL 32B Instruct | Mar 24, 2025 | 32B | 128K |
Text input
Image input
Text output
|
★★ | ★★★ | $$$ |
Qwen: Qwen VL Plus | Feb 04, 2025 | — | 7K |
Text input
Image input
Text output
|
★★★★ | ★★ | $$$ |
Qwen: Qwen VL Max | Feb 01, 2025 | — | 7K |
Text input
Image input
Text output
|
★★★★★ | ★★★ | $$$$ |
Qwen: Qwen-Turbo | Feb 01, 2025 | — | 1M |
Text input
Text output
|
★★★★★ | ★★★★ | $$ |
Qwen: Qwen2.5 VL 72B Instruct | Feb 01, 2025 | 72B | 32K |
Text input
Image input
Text output
|
★★★★ | ★★★★ | $$ |
Qwen: Qwen-Plus | Feb 01, 2025 | — | 131K |
Text input
Text output
|
★★★ | ★★★★ | $$$ |
Qwen: Qwen-Max | Feb 01, 2025 | — | 32K |
Text input
Text output
|
★★★ | ★★★★ | $$$$ |
Qwen: QwQ 32B Preview | Nov 27, 2024 | 32B | 32K |
Text input
Text output
|
— | ★ | $$ |
Qwen2.5 Coder 32B Instruct | Nov 11, 2024 | ~500B | 32K |
Text input
Text output
|
★★★★★ | ★★★★★ | $ |
Qwen2.5 7B Instruct | Oct 15, 2024 | ~500B | 32K |
Text input
Text output
|
★ | ★★★ | $$ |
Qwen2.5 72B Instruct | Sep 18, 2024 | ~500B | 32K |
Text input
Text output
|
★★★ | ★★★ | $$$ |
Qwen: Qwen2.5-VL 7B Instruct | Aug 27, 2024 | ~500B | 32K |
Text input
Image input
Text output
|
★★★ | ★★★ | $$$ |
Qwen 2 72B Instruct | Jun 06, 2024 | ~500B | 32K |
Text input
Text output
|
★★★★ | ★★ | $$$$ |