Qwen: Qwen3 Next 80B A3B Thinking

Text input Text output
Author's Description

Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that outputs structured “thinking” traces by default. It’s designed for hard multi-step problems; math proofs, code synthesis/debugging, logic, and agentic planning, and reports strong results across knowledge, reasoning, coding, alignment, and multilingual evaluations. Compared with prior Qwen3 variants, it emphasizes stability under long chains of thought and efficient scaling during inference, and it is tuned to follow complex instructions while reducing repetitive or off-task behavior. The model is suitable for agent frameworks and tool use (function calling), retrieval-heavy workflows, and standardized benchmarking where step-by-step solutions are required. It supports long, detailed completions and leverages throughput-oriented techniques (e.g., multi-token prediction) for faster generation. Note that it operates in thinking-only mode.

Key Specifications
Cost
$$$$$
Context
262K
Parameters
80B
Released
Sep 11, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Tool Choice Response Format Presence Penalty Top P Seed Max Tokens Include Reasoning Temperature Tools Reasoning
Features

This model supports the following features:

Reasoning Tools Response Format
Performance Summary

Qwen3-Next-80B-A3B-Thinking, created on September 11, 2025, is a reasoning-first chat model designed for complex multi-step problems. It operates in a thinking-only mode, outputting structured "thinking" traces by default, making it highly suitable for agent frameworks, tool use, and retrieval-heavy workflows. The model exhibits exceptional reliability, achieving a 100% success rate across all benchmarks, indicating consistent and usable responses. However, it tends to have longer response times, ranking in the 8th percentile for speed, and is positioned at premium pricing levels, ranking in the 5th percentile for cost. In terms of performance, Qwen3-Next-80B-A3B-Thinking demonstrates significant strengths in specialized areas. It achieves outstanding accuracy in Coding (94.9%), Reasoning (96.0%), and Ethics (100%), with the Ethics benchmark highlighting it as the most accurate model at its price point and among models of similar speed. Its General Knowledge (99.5%) and Email Classification (99.0%) capabilities are also strong. A notable weakness is its Instruction Following, where it scored only 14.7% accuracy, indicating challenges with highly complex, multi-layered instructions despite its design for structured thinking. This suggests that while it excels at generating detailed thought processes, translating those into precise, multi-faceted instruction adherence remains an area for improvement.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.5
Completion $6

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Alibaba
Alibaba | qwen/qwen3-next-80b-a3b-thinking-2509 262K $0.5 / 1M tokens $6 / 1M tokens
Novita
Novita | qwen/qwen3-next-80b-a3b-thinking-2509 65K $0.15 / 1M tokens $1.5 / 1M tokens
Chutes
Chutes | qwen/qwen3-next-80b-a3b-thinking-2509 262K $0.147 / 1M tokens $0.587 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by qwen