Qwen: Qwen-Max

Text input Text output
Author's Description

Qwen-Max, based on Qwen2.5, provides the best inference performance among [Qwen models](/qwen), especially for complex multi-step tasks. It's a large-scale MoE model that has been pretrained on over 20 trillion...

Key Specifications
Cost
$$$$
Context
32K
Released
Feb 01, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Seed Tools Top P Response Format Temperature Presence Penalty Tool Choice Max Tokens
Features

This model supports the following features:

Response Format Tools
Performance Summary

Qwen-Max, based on Qwen2.5, demonstrates competitive response times, performing among the faster models with a 58th percentile speed ranking. Its pricing is moderate, positioned at the 30th percentile, offering a balanced cost-to-performance ratio. A standout feature is its exceptional reliability, boasting a 99% success rate across benchmarks, indicating minimal technical failures and consistent evaluable responses. In terms of specific performance, Qwen-Max achieves perfect accuracy in the Hallucinations (Baseline) test, making it the most accurate model at its price point and among models of similar speed for this category. It also shows strong performance in Instruction Following (71.0% accuracy, 77th percentile) and Reasoning (82.0% accuracy, 64th percentile), aligning with its description as excelling in complex multi-step tasks. General Knowledge (97.0% accuracy) and Ethics (99.0% accuracy) benchmarks also show solid results. Email Classification and Coding benchmarks yielded 98.0% and 86.0% accuracy respectively, indicating robust capabilities across diverse domains. Its primary strength lies in its ability to avoid hallucinations and its high reliability, making it a dependable choice for critical applications. No significant weaknesses are apparent from the provided data.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $1.04
Completion $4.16
Input Cache Read $0.208

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Alibaba
Alibaba | qwen/qwen-max-2025-01-25 32K $1.04 / 1M tokens $4.16 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by qwen