Qwen: QwQ 32B

Text input Text output
Author's Description

QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, especially hard problems. QwQ-32B is the medium-sized reasoning model, which is capable of achieving competitive performance against state-of-the-art reasoning models, e.g., DeepSeek-R1, o1-mini.

Key Specifications
Cost
$$$
Context
131K
Parameters
32B
Released
Mar 05, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Reasoning Include Reasoning Response Format Stop Seed Min P Top P Max Tokens Frequency Penalty Temperature Presence Penalty
Features

This model supports the following features:

Reasoning Response Format
Performance Summary

Qwen: QwQ 32B, a reasoning model from qwen, demonstrates exceptional speed, consistently ranking among the fastest models with an Infinityth percentile across 9 benchmarks. It offers competitive pricing, placing in the 48th percentile across 8 benchmarks, and exhibits outstanding reliability with a 99% success rate. The model excels in Reasoning (98.0% accuracy, 95th percentile) and Mathematics (95.9% accuracy, 97th percentile), showcasing its strength in complex problem-solving. It also achieves perfect accuracy in Ethics (100.0%), making it a standout in this category. While performing well in Coding (91.0% accuracy) and General Knowledge (99.0% accuracy), its performance in Hallucinations (86.0% accuracy, 28th percentile) and Instruction Following (53.0% accuracy, 54th percentile) indicates areas for potential improvement. Despite some variability in accuracy, its high reliability and speed make it a strong contender for tasks requiring robust and efficient processing.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.15
Completion $0.4

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
DeepInfra
DeepInfra | qwen/qwq-32b 131K $0.15 / 1M tokens $0.4 / 1M tokens
Nebius
Nebius | qwen/qwq-32b 131K $0.5 / 1M tokens $1.5 / 1M tokens
InferenceNet
InferenceNet | qwen/qwq-32b 16K $0.15 / 1M tokens $0.4 / 1M tokens
Groq
Groq | qwen/qwq-32b 131K $0.15 / 1M tokens $0.4 / 1M tokens
Hyperbolic
Hyperbolic | qwen/qwq-32b 131K $0.4 / 1M tokens $0.4 / 1M tokens
SambaNova
SambaNova | qwen/qwq-32b 16K $0.15 / 1M tokens $0.4 / 1M tokens
Cent-ML
Cent-ML | qwen/qwq-32b 40K $0.15 / 1M tokens $0.4 / 1M tokens
Fireworks
Fireworks | qwen/qwq-32b 131K $0.15 / 1M tokens $0.4 / 1M tokens
Together
Together | qwen/qwq-32b 131K $1.2 / 1M tokens $1.2 / 1M tokens
Nineteen
Nineteen | qwen/qwq-32b 40K $0.15 / 1M tokens $0.4 / 1M tokens
NextBit
NextBit | qwen/qwq-32b 32K $0.15 / 1M tokens $0.4 / 1M tokens
SiliconFlow
SiliconFlow | qwen/qwq-32b 131K $0.15 / 1M tokens $0.58 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by qwen