Qwen: Qwen Plus 0728

Text input Text output
Author's Description

Qwen Plus 0728, based on the Qwen3 foundation model, is a 1 million context hybrid reasoning model with a balanced performance, speed, and cost combination.

Key Specifications
Cost
$$$
Context
1M
Parameters
20B (Rumoured)
Released
Sep 08, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Tool Choice Structured Outputs Response Format Presence Penalty Top P Seed Max Tokens Temperature Tools
Features

This model supports the following features:

Tools Response Format Structured Outputs
Performance Summary

Qwen Plus 0728, a 1 million context hybrid reasoning model, demonstrates a balanced performance profile. In terms of speed, it exhibits competitive response times, ranking in the 60th percentile across evaluated benchmarks, indicating it performs among the faster models available. Its pricing is also competitive, placing it in the 59th percentile. A standout feature is its exceptional reliability, achieving a 100% success rate across all benchmarks, signifying consistent and dependable operation with minimal technical failures. Across specific benchmarks, the model shows varied performance. In Email Classification, it achieved 94.1% accuracy, placing it in the 33rd percentile, suggesting room for improvement in this specific task despite a high absolute accuracy. Its cost for this task was competitive, and duration was average. For Instruction Following, the model achieved 50.0% accuracy, ranking in the 49th percentile, indicating a moderate capability in handling complex instructions. Its cost for instruction following was competitive, though its duration was in the 71st percentile, suggesting it can be slower on more complex instruction sets. Overall, Qwen Plus 0728's key strengths lie in its high reliability, competitive speed, and cost-effectiveness, while its accuracy in specific classification and instruction following tasks presents areas for potential enhancement.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.4
Completion $1.2

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Alibaba
Alibaba | qwen/qwen-plus-2025-07-28 1M $0.4 / 1M tokens $1.2 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by qwen