Qwen: Qwen3 30B A3B Thinking 2507

Text input Text output
Author's Description

Qwen3-30B-A3B-Thinking-2507 is a 30B parameter Mixture-of-Experts reasoning model optimized for complex tasks requiring extended multi-step thinking. The model is designed specifically for “thinking mode,” where internal reasoning traces are separated from final answers. Compared to earlier Qwen3-30B releases, this version improves performance across logical reasoning, mathematics, science, coding, and multilingual benchmarks. It also demonstrates stronger instruction following, tool use, and alignment with human preferences. With higher reasoning efficiency and extended output budgets, it is best suited for advanced research, competitive problem solving, and agentic applications requiring structured long-context reasoning.

Key Specifications
Cost
$$$$
Context
262K
Parameters
30B
Released
Aug 28, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Top Logprobs Reasoning Logprobs Include Reasoning Logit Bias Stop Top P Seed Frequency Penalty Tool Choice Max Tokens Tools Presence Penalty Temperature
Features

This model supports the following features:

Tools Reasoning
Performance Summary

Qwen3-30B-A3B-Thinking-2507 demonstrates exceptional overall performance, particularly excelling in speed and cost-efficiency. It consistently ranks among the fastest models and offers highly competitive pricing across all evaluated benchmarks. Reliability is a significant strength, with a perfect 100% success rate indicating robust technical stability. In terms of specific benchmark performance, the model achieved perfect accuracy in Ethics (Baseline), showcasing its strong alignment with ethical principles. It also performed very strongly in Reasoning (94.0% accuracy), Coding (92.9% accuracy), Email Classification (99.0% accuracy), and General Knowledge (99.5% accuracy), indicating broad capabilities across diverse cognitive tasks. A notable weakness is observed in Instruction Following (Baseline), where it scored 0.0% accuracy, suggesting a critical area for improvement in processing complex, multi-layered directives. Despite this, its high accuracy in other complex domains like reasoning and coding, combined with its optimized "thinking mode" for structured long-context reasoning, positions it well for advanced research and agentic applications. Its efficiency and reliability make it a compelling choice for competitive problem-solving.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.1
Completion $0.3

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Nebius
Nebius | qwen/qwen3-30b-a3b-thinking-2507 262K $0.1 / 1M tokens $0.3 / 1M tokens
Chutes
Chutes | qwen/qwen3-30b-a3b-thinking-2507 262K $0.0713 / 1M tokens $0.285 / 1M tokens
Alibaba
Alibaba | qwen/qwen3-30b-a3b-thinking-2507 131K $0.2 / 1M tokens $2.4 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Free Executions Accuracy Cost Duration
Other Models by qwen