Qwen: Qwen3 Next 80B A3B Instruct

Text input Text output
Author's Description

Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series optimized for fast, stable responses without “thinking” traces. It targets complex tasks across reasoning, code generation, knowledge QA, and multilingual use, while remaining robust on alignment and formatting. Compared with prior Qwen3 instruct variants, it focuses on higher throughput and stability on ultra-long inputs and multi-turn dialogues, making it well-suited for RAG, tool use, and agentic workflows that require consistent final answers rather than visible chain-of-thought. The model employs scaling-efficient training and decoding to improve parameter efficiency and inference speed, and has been validated on a broad set of public benchmarks where it reaches or approaches larger Qwen3 systems in several categories while outperforming earlier mid-sized baselines. It is best used as a general assistant, code helper, and long-context task solver in production settings where deterministic, instruction-following outputs are preferred.

Key Specifications
Cost
$$$$
Context
262K
Parameters
80B
Released
Sep 11, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Top Logprobs Stop Logprobs Presence Penalty Frequency Penalty Top P Tool Choice Max Tokens Min P Logit Bias Seed Temperature Tools
Features

This model supports the following features:

Tools
Performance Summary

Qwen3-Next-80B-A3B-Instruct demonstrates strong overall performance, particularly excelling in reliability with a perfect 100% success rate across all benchmarks, indicating exceptional stability and consistent response delivery. The model performs among the fastest models, ranking in the 76th percentile for speed, and offers competitive pricing, placing in the 46th percentile. Its key strengths lie in its perfect accuracy on Hallucinations, General Knowledge, and Ethics benchmarks, often achieving this at competitive price points and speeds. It also shows robust performance in Mathematics (93.0% accuracy), Email Classification (99.0% accuracy), and Coding (92.0% accuracy). While its Instruction Following (63.0% accuracy) and Reasoning (88.0% accuracy) scores are respectable, they represent areas where there is some room for improvement compared to its perfect scores in other categories. The model's design for high throughput and stability on long inputs and multi-turn dialogues, coupled with its strong performance across diverse tasks, positions it as a reliable choice for production settings requiring deterministic, instruction-following outputs.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.1
Completion $0.8

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Hyperbolic
Hyperbolic | qwen/qwen3-next-80b-a3b-instruct-2509 262K $0.1 / 1M tokens $0.8 / 1M tokens
Alibaba
Alibaba | qwen/qwen3-next-80b-a3b-instruct-2509 131K $0.5 / 1M tokens $2 / 1M tokens
Novita
Novita | qwen/qwen3-next-80b-a3b-instruct-2509 131K $0.1 / 1M tokens $0.8 / 1M tokens
Chutes
Chutes | qwen/qwen3-next-80b-a3b-instruct-2509 262K $0.1 / 1M tokens $0.8 / 1M tokens
DeepInfra
DeepInfra | qwen/qwen3-next-80b-a3b-instruct-2509 262K $0.14 / 1M tokens $1.1 / 1M tokens
Hyperbolic
Hyperbolic | qwen/qwen3-next-80b-a3b-instruct-2509 262K $0.3 / 1M tokens $0.3 / 1M tokens
AtlasCloud
AtlasCloud | qwen/qwen3-next-80b-a3b-instruct-2509 262K $0.15 / 1M tokens $1.5 / 1M tokens
GMICloud
GMICloud | qwen/qwen3-next-80b-a3b-instruct-2509 262K $0.15 / 1M tokens $1.5 / 1M tokens
Parasail
Parasail | qwen/qwen3-next-80b-a3b-instruct-2509 262K $0.1 / 1M tokens $0.8 / 1M tokens
NCompass
NCompass | qwen/qwen3-next-80b-a3b-instruct-2509 262K $0.1 / 1M tokens $0.8 / 1M tokens
Google
Google | qwen/qwen3-next-80b-a3b-instruct-2509 262K $0.15 / 1M tokens $1.2 / 1M tokens
Parasail
Parasail | qwen/qwen3-next-80b-a3b-instruct-2509 262K $0.1 / 1M tokens $1.1 / 1M tokens
Chutes
Chutes | qwen/qwen3-next-80b-a3b-instruct-2509 262K $0.1 / 1M tokens $0.8 / 1M tokens
SiliconFlow
SiliconFlow | qwen/qwen3-next-80b-a3b-instruct-2509 262K $0.14 / 1M tokens $1.4 / 1M tokens
Novita
Novita | qwen/qwen3-next-80b-a3b-instruct-2509 131K $0.15 / 1M tokens $1.5 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by qwen