Qwen: Qwen3 Next 80B A3B Instruct

Text input Text output
Author's Description

Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series optimized for fast, stable responses without “thinking” traces. It targets complex tasks across reasoning, code generation, knowledge QA, and multilingual use, while remaining robust on alignment and formatting. Compared with prior Qwen3 instruct variants, it focuses on higher throughput and stability on ultra-long inputs and multi-turn dialogues, making it well-suited for RAG, tool use, and agentic workflows that require consistent final answers rather than visible chain-of-thought. The model employs scaling-efficient training and decoding to improve parameter efficiency and inference speed, and has been validated on a broad set of public benchmarks where it reaches or approaches larger Qwen3 systems in several categories while outperforming earlier mid-sized baselines. It is best used as a general assistant, code helper, and long-context task solver in production settings where deterministic, instruction-following outputs are preferred.

Key Specifications
Cost
$$$$
Context
262K
Parameters
80B
Released
Sep 11, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Stop Frequency Penalty Seed Top Logprobs Min P Top P Presence Penalty Temperature Logit Bias Tool Choice Max Tokens Logprobs Tools
Features

This model supports the following features:

Tools
Performance Summary

Qwen3 Next 80B A3B Instruct demonstrates strong overall performance, particularly excelling in reliability with a perfect 100% success rate across all benchmarks, indicating consistent and stable operation. The model performs among the fastest models, ranking in the 77th percentile for speed, and offers competitive pricing, placing in the 50th percentile. Its key strengths lie in its exceptional accuracy in foundational tasks. It achieved perfect scores in Hallucinations (Baseline), General Knowledge (Baseline), and Ethics (Baseline), showcasing its ability to avoid generating false information, possess a broad understanding of facts, and adhere to ethical principles. It also performed very well in Email Classification (99.0% accuracy) and Coding (92.0% accuracy), indicating proficiency in practical application and programming tasks. While strong in Mathematics (93.0% accuracy) and Reasoning (88.0% accuracy), these areas, along with Instruction Following (63.0% accuracy), represent areas where there is some room for improvement compared to its perfect scores in other categories. The model is well-suited for production environments requiring deterministic, instruction-following outputs, especially in RAG, tool use, and agentic workflows.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.09
Completion $1.1

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Hyperbolic
Hyperbolic | qwen/qwen3-next-80b-a3b-instruct-2509 262K $0.09 / 1M tokens $1.1 / 1M tokens
Alibaba
Alibaba | qwen/qwen3-next-80b-a3b-instruct-2509 131K $0.0975 / 1M tokens $0.78 / 1M tokens
Novita
Novita | qwen/qwen3-next-80b-a3b-instruct-2509 131K $0.09 / 1M tokens $1.1 / 1M tokens
Chutes
Chutes | qwen/qwen3-next-80b-a3b-instruct-2509 262K $0.09 / 1M tokens $1.1 / 1M tokens
DeepInfra
DeepInfra | qwen/qwen3-next-80b-a3b-instruct-2509 262K $0.09 / 1M tokens $1.1 / 1M tokens
Hyperbolic
Hyperbolic | qwen/qwen3-next-80b-a3b-instruct-2509 262K $0.09 / 1M tokens $1.1 / 1M tokens
AtlasCloud
AtlasCloud | qwen/qwen3-next-80b-a3b-instruct-2509 262K $0.15 / 1M tokens $1.5 / 1M tokens
GMICloud
GMICloud | qwen/qwen3-next-80b-a3b-instruct-2509 262K $0.09 / 1M tokens $1.1 / 1M tokens
Parasail
Parasail | qwen/qwen3-next-80b-a3b-instruct-2509 262K $0.09 / 1M tokens $1.1 / 1M tokens
NCompass
NCompass | qwen/qwen3-next-80b-a3b-instruct-2509 262K $0.09 / 1M tokens $1.1 / 1M tokens
Google
Google | qwen/qwen3-next-80b-a3b-instruct-2509 262K $0.15 / 1M tokens $1.2 / 1M tokens
Parasail
Parasail | qwen/qwen3-next-80b-a3b-instruct-2509 262K $0.1 / 1M tokens $1.1 / 1M tokens
Chutes
Chutes | qwen/qwen3-next-80b-a3b-instruct-2509 262K $0.09 / 1M tokens $1.1 / 1M tokens
SiliconFlow
SiliconFlow | qwen/qwen3-next-80b-a3b-instruct-2509 262K $0.09 / 1M tokens $1.1 / 1M tokens
Novita
Novita | qwen/qwen3-next-80b-a3b-instruct-2509 131K $0.15 / 1M tokens $1.5 / 1M tokens
GMICloud
GMICloud | qwen/qwen3-next-80b-a3b-instruct-2509 262K $0.09 / 1M tokens $1.1 / 1M tokens
DeepInfra
DeepInfra | qwen/qwen3-next-80b-a3b-instruct-2509 262K $0.09 / 1M tokens $1.1 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by qwen