Author's Description
Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that outputs structured “thinking” traces by default. It’s designed for hard multi-step problems; math proofs, code synthesis/debugging, logic, and agentic planning, and reports strong results across knowledge, reasoning, coding, alignment, and multilingual evaluations. Compared with prior Qwen3 variants, it emphasizes stability under long chains of thought and efficient scaling during inference, and it is tuned to follow complex instructions while reducing repetitive or off-task behavior. The model is suitable for agent frameworks and tool use (function calling), retrieval-heavy workflows, and standardized benchmarking where step-by-step solutions are required. It supports long, detailed completions and leverages throughput-oriented techniques (e.g., multi-token prediction) for faster generation. Note that it operates in thinking-only mode.
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model designed for complex multi-step problems, emphasizing structured "thinking" traces. While it demonstrates exceptional reliability with a 99% success rate, its speed performance is notably slower, ranking in the 12th percentile, and it is positioned at premium pricing levels (5th percentile). The model exhibits strong performance in specialized areas. It achieves high accuracy in Coding (94.9%, 93rd percentile), Reasoning (96.0%, 86th percentile), and General Knowledge (99.5%, 72nd percentile). Its Ethics performance is perfect at 100% accuracy, making it the most accurate model at its price point and among models of similar speed. Hallucinations are well-managed with 98.0% accuracy. However, a significant weakness is its Instruction Following capability, where it scores only 14.7% accuracy (25th percentile), indicating challenges with complex, multi-layered instructions. Mathematics performance is moderate at 88.9% (61st percentile). This model is best suited for applications requiring detailed, step-by-step solutions and high reliability, particularly in agent frameworks and tool use, where its "thinking-only" mode can be leveraged despite its slower response times and higher cost.
Model Pricing
Current Pricing
| Feature | Price (per 1M tokens) |
|---|---|
| Prompt | $0.5 |
| Completion | $6 |
Price History
Available Endpoints
| Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
|---|---|---|---|---|
|
Alibaba
|
Alibaba | qwen/qwen3-next-80b-a3b-thinking-2509 | 262K | $0.5 / 1M tokens | $6 / 1M tokens |
|
Novita
|
Novita | qwen/qwen3-next-80b-a3b-thinking-2509 | 131K | $0.15 / 1M tokens | $1.2 / 1M tokens |
|
Chutes
|
Chutes | qwen/qwen3-next-80b-a3b-thinking-2509 | 262K | $0.15 / 1M tokens | $1.2 / 1M tokens |
|
DeepInfra
|
DeepInfra | qwen/qwen3-next-80b-a3b-thinking-2509 | 262K | $0.15 / 1M tokens | $1.2 / 1M tokens |
|
Hyperbolic
|
Hyperbolic | qwen/qwen3-next-80b-a3b-thinking-2509 | 262K | $0.3 / 1M tokens | $0.3 / 1M tokens |
|
GMICloud
|
GMICloud | qwen/qwen3-next-80b-a3b-thinking-2509 | 262K | $0.15 / 1M tokens | $1.2 / 1M tokens |
|
AtlasCloud
|
AtlasCloud | qwen/qwen3-next-80b-a3b-thinking-2509 | 262K | $0.15 / 1M tokens | $1.5 / 1M tokens |
|
NCompass
|
NCompass | qwen/qwen3-next-80b-a3b-thinking-2509 | 262K | $0.15 / 1M tokens | $1.2 / 1M tokens |
|
Together
|
Together | qwen/qwen3-next-80b-a3b-thinking-2509 | 262K | $0.15 / 1M tokens | $1.5 / 1M tokens |
|
Parasail
|
Parasail | qwen/qwen3-next-80b-a3b-thinking-2509 | 262K | $0.15 / 1M tokens | $1.2 / 1M tokens |
|
Google
|
Google | qwen/qwen3-next-80b-a3b-thinking-2509 | 262K | $0.15 / 1M tokens | $1.2 / 1M tokens |
|
Parasail
|
Parasail | qwen/qwen3-next-80b-a3b-thinking-2509 | 262K | $0.15 / 1M tokens | $1.2 / 1M tokens |
|
Chutes
|
Chutes | qwen/qwen3-next-80b-a3b-thinking-2509 | 262K | $0.15 / 1M tokens | $1.2 / 1M tokens |
|
Novita
|
Novita | qwen/qwen3-next-80b-a3b-thinking-2509 | 131K | $0.15 / 1M tokens | $1.5 / 1M tokens |
Benchmark Results
| Benchmark | Category | Reasoning | Strategy | Free | Executions | Accuracy | Cost | Duration |
|---|
Other Models by qwen
|
|
Released | Params | Context |
|
Speed | Ability | Cost |
|---|---|---|---|---|---|---|---|
| Qwen: Qwen3 VL 32B Instruct | Oct 23, 2025 | 32B | 262K |
Image input
Text input
Text output
|
★★★ | ★★★★★ | $$ |
| Qwen: Qwen3 VL 8B Thinking | Oct 14, 2025 | 8B | 256K |
Image input
Text input
Text output
|
★ | ★ | $$$$$ |
| Qwen: Qwen3 VL 8B Instruct | Oct 14, 2025 | 8B | 256K |
Image input
Text input
Text output
|
★ | ★★ | $$$ |
| Qwen: Qwen3 VL 30B A3B Thinking | Oct 06, 2025 | 30B | 262K |
Image input
Text input
Text output
|
★ | ★★★ | $$$$ |
| Qwen: Qwen3 VL 30B A3B Instruct | Oct 06, 2025 | 30B | 262K |
Image input
Text input
Text output
|
— | — | $$$ |
| Qwen: Qwen3 VL 235B A22B Thinking | Sep 23, 2025 | 235B | 131K |
Image input
Text input
Text output
|
★ | ★ | $$$$$ |
| Qwen: Qwen3 VL 235B A22B Instruct | Sep 23, 2025 | 235B | 131K |
Image input
Text input
Text output
|
★★★ | ★★★★★ | $$$ |
| Qwen: Qwen3 Max | Sep 23, 2025 | — | 256K |
Text input
Text output
|
★★★★ | ★★★★★ | $$$$ |
| Qwen: Qwen3 Coder Plus | Sep 23, 2025 | ~480B | 128K |
Text input
Text output
|
★★★★ | ★★★★ | $$$$ |
| Qwen: Qwen3 Coder Flash | Sep 17, 2025 | — | 128K |
Text input
Text output
|
★★★★ | ★★★ | $$$ |
| Qwen: Qwen3 Next 80B A3B Instruct | Sep 11, 2025 | 80B | 262K |
Text input
Text output
|
★★★★ | ★★★★★ | $$$$ |
| Qwen: Qwen Plus 0728 | Sep 08, 2025 | ~20B | 1M |
Text input
Text output
|
★★★★★ | ★★★ | $$$ |
| Qwen: Qwen3 30B A3B Thinking 2507 | Aug 28, 2025 | 30B | 262K |
Text input
Text output
|
★★ | ★★★ | $$$$ |
| Qwen: Qwen3 Coder 30B A3B Instruct | Jul 31, 2025 | 30B | 262K |
Text input
Text output
|
★★★★ | ★★★ | $$ |
| Qwen: Qwen3 30B A3B Instruct 2507 | Jul 29, 2025 | 30B | 131K |
Text input
Text output
|
★★★★ | ★★★★ | $$$ |
| Qwen: Qwen3 235B A22B Thinking 2507 | Jul 25, 2025 | 235B | 131K |
Text input
Text output
|
★ | ★★★★ | $$$$$ |
| Qwen: Qwen3 Coder 480B A35B | Jul 22, 2025 | 480B | 1M |
Text input
Text output
|
★★ | ★★★ | $$$ |
| Qwen: Qwen3 Coder 480B A35B (exacto) | Jul 22, 2025 | 480B | 262K |
Text input
Text output
|
— | — | $$$$ |
| Qwen: Qwen3 235B A22B Instruct 2507 | Jul 21, 2025 | 235B | 262K |
Text input
Text output
|
★★ | ★★★ | $$$ |
| Qwen: Qwen3 30B A3B | Apr 28, 2025 | 30B | 40K |
Text input
Text output
|
★ | ★★★★★ | $$$$ |
| Qwen: Qwen3 8B | Apr 28, 2025 | 8B | 128K |
Text input
Text output
|
★ | ★★★ | $$$ |
| Qwen: Qwen3 14B | Apr 28, 2025 | 14B | 40K |
Text input
Text output
|
★★ | ★★★★ | $$$ |
| Qwen: Qwen3 32B | Apr 28, 2025 | 32B | 40K |
Text input
Text output
|
★ | ★★★★★ | $$$ |
| Qwen: Qwen3 235B A22B | Apr 28, 2025 | 235B | 40K |
Text input
Text output
|
★ | ★★★★ | $$$$ |
| Qwen: Qwen2.5 Coder 7B Instruct | Apr 15, 2025 | 7B | 32K |
Text input
Text output
|
— | — | $ |
| Qwen: Qwen2.5 VL 32B Instruct | Mar 24, 2025 | 32B | 128K |
Image input
Text input
Text output
|
★ | ★★★ | $$$ |
| Qwen: QwQ 32B | Mar 05, 2025 | 32B | 131K |
Text input
Text output
|
★ | ★★★ | $$$ |
| Qwen: Qwen VL Plus | Feb 04, 2025 | — | 7K |
Image input
Text input
Text output
|
★★★★ | ★★ | $$$ |
| Qwen: Qwen VL Max | Feb 01, 2025 | — | 131K |
Image input
Text input
Text output
|
★★★ | ★★★ | $$$$ |
| Qwen: Qwen-Turbo | Feb 01, 2025 | — | 1M |
Text input
Text output
|
★★★★★ | ★★★★ | $$ |
| Qwen: Qwen2.5 VL 72B Instruct | Feb 01, 2025 | 72B | 32K |
Image input
Text input
Text output
|
★★★★ | ★★★★ | $$ |
| Qwen: Qwen-Plus | Feb 01, 2025 | — | 131K |
Text input
Text output
|
★★★★ | ★★★★ | $$$ |
| Qwen: Qwen-Max | Feb 01, 2025 | — | 32K |
Text input
Text output
|
★★★★ | ★★★★ | $$$$ |
| Qwen: QwQ 32B Preview Unavailable | Nov 27, 2024 | 32B | 32K |
Text input
Text output
|
— | ★ | $$ |
| Qwen2.5 Coder 32B Instruct | Nov 11, 2024 | ~500B | 32K |
Text input
Text output
|
★★★★★ | ★★★★★ | $ |
| Qwen: Qwen2.5 7B Instruct | Oct 15, 2024 | ~500B | 32K |
Text input
Text output
|
★ | ★★ | $ |
| Qwen2.5 72B Instruct | Sep 18, 2024 | ~500B | 32K |
Text input
Text output
|
★★★ | ★★ | $$ |
| Qwen: Qwen2.5-VL 7B Instruct | Aug 27, 2024 | ~500B | 32K |
Image input
Text input
Text output
|
★★★★ | ★★ | $$ |
| Qwen 2 72B Instruct Unavailable | Jun 06, 2024 | ~500B | 32K |
Text input
Text output
|
★★★★ | ★★ | $$$$ |