Qwen: Qwen3 30B A3B

Text input Text output
Author's Description

Qwen3, the latest generation in the Qwen large language model series, features both dense and mixture-of-experts (MoE) architectures to excel in reasoning, multilingual support, and advanced agent tasks. Its unique ability to switch seamlessly between a thinking mode for complex reasoning and a non-thinking mode for efficient dialogue ensures versatile, high-quality performance. Significantly outperforming prior models like QwQ and Qwen2.5, Qwen3 delivers superior mathematics, coding, commonsense reasoning, creative writing, and interactive dialogue capabilities. The Qwen3-30B-A3B variant includes 30.5 billion parameters (3.3 billion activated), 48 layers, 128 experts (8 activated per task), and supports up to 131K token contexts with YaRN, setting a new standard among open-source models.

Key Specifications
Cost
$$$$
Context
40K
Parameters
30B
Released
Apr 28, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Tool Choice Reasoning Include Reasoning Response Format Seed Top P Temperature Tools Stop Min P Max Tokens Frequency Penalty Presence Penalty
Features

This model supports the following features:

Tools Reasoning Response Format
Performance Summary

Qwen3 30B A3B, a 30.5 billion parameter model from the Qwen3 series, demonstrates strong performance across a diverse range of benchmarks. While its speed ranking places it in the 16th percentile, indicating generally longer response times, it offers competitive pricing, ranking in the 47th percentile. A standout feature is its exceptional reliability, achieving a 100% success rate across all evaluated benchmarks, signifying consistent and stable operation. The model excels in several critical areas. It achieved perfect accuracy in both General Knowledge and Ethics, with the former also being noted as the most accurate model at its price point and among models of comparable speed. Its Coding and Reasoning capabilities are also very strong, scoring 94.0% and 96.0% accuracy respectively, placing it in the 92nd and 88th percentiles. Email Classification is another strength, with 99.0% accuracy. While its Hallucinations score of 96.0% is respectable, it indicates a slight tendency to not always acknowledge uncertainty. Instruction Following, at 60.8% accuracy, represents a relative area for improvement compared to its other high-performing categories. Mathematics performance is solid at 93.0% accuracy. Overall, Qwen3 30B A3B is a highly reliable model with significant strengths in knowledge, ethics, coding, and reasoning, making it a versatile option despite its slower processing speed.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.08
Completion $0.29

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
DeepInfra
DeepInfra | qwen/qwen3-30b-a3b-04-28 40K $0.08 / 1M tokens $0.29 / 1M tokens
InferenceNet
InferenceNet | qwen/qwen3-30b-a3b-04-28 16K $0.06 / 1M tokens $0.22 / 1M tokens
Parasail
Parasail | qwen/qwen3-30b-a3b-04-28 40K $0.09 / 1M tokens $0.5 / 1M tokens
Nebius
Nebius | qwen/qwen3-30b-a3b-04-28 40K $0.1 / 1M tokens $0.3 / 1M tokens
Novita
Novita | qwen/qwen3-30b-a3b-04-28 40K $0.09 / 1M tokens $0.45 / 1M tokens
Fireworks
Fireworks | qwen/qwen3-30b-a3b-04-28 131K $0.06 / 1M tokens $0.22 / 1M tokens
Friendli
Friendli | qwen/qwen3-30b-a3b-04-28 131K $0.15 / 1M tokens $0.6 / 1M tokens
NextBit
NextBit | qwen/qwen3-30b-a3b-04-28 32K $0.06 / 1M tokens $0.22 / 1M tokens
Chutes
Chutes | qwen/qwen3-30b-a3b-04-28 40K $0.06 / 1M tokens $0.22 / 1M tokens
SiliconFlow
SiliconFlow | qwen/qwen3-30b-a3b-04-28 131K $0.09 / 1M tokens $0.45 / 1M tokens
NCompass
NCompass | qwen/qwen3-30b-a3b-04-28 131K $0.08 / 1M tokens $0.28 / 1M tokens
Crusoe
Crusoe | qwen/qwen3-30b-a3b-04-28 40K $0.1 / 1M tokens $0.3 / 1M tokens
NextBit
NextBit | qwen/qwen3-30b-a3b-04-28 32K $0.14 / 1M tokens $0.55 / 1M tokens
Chutes
Chutes | qwen/qwen3-30b-a3b-04-28 40K $0.06 / 1M tokens $0.22 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by qwen