Qwen: Qwen-Max

Text input Text output
Author's Description

Qwen-Max, based on Qwen2.5, provides the best inference performance among [Qwen models](/qwen), especially for complex multi-step tasks. It's a large-scale MoE model that has been pretrained on over 20 trillion tokens and further post-trained with curated Supervised Fine-Tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF) methodologies. The parameter count is unknown.

Key Specifications
Cost
$$$$
Context
32K
Released
Feb 01, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Presence Penalty Tool Choice Top P Temperature Seed Tools Response Format Max Tokens
Features

This model supports the following features:

Tools Response Format
Performance Summary

Qwen-Max, a large-scale MoE model from Qwen, demonstrates competitive response times, ranking in the 54th percentile across six benchmarks. Its pricing is moderate, placing it in the 29th percentile. A standout feature is its exceptional reliability, achieving the 99th percentile with minimal technical failures, ensuring consistent and usable responses. Across benchmark categories, Qwen-Max exhibits strong performance in Instruction Following (88th percentile accuracy) and Reasoning (80th percentile accuracy), aligning with its description as excelling in complex multi-step tasks. It also shows high accuracy in Email Classification (98%) and Ethics (99%), indicating robust understanding and adherence to principles. While its General Knowledge accuracy is high at 97%, its percentile ranking (53rd) suggests a competitive but not leading position in this area. Coding performance is solid at 86% accuracy (71st percentile). The model's primary strength lies in its high accuracy across diverse tasks, particularly those requiring intricate instruction adherence and logical deduction, coupled with its remarkable reliability. No significant weaknesses are apparent in its accuracy or reliability, though its speed and cost are competitive rather than market-leading.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $1.6
Completion $6.4
Input Cache Read $0.64

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Alibaba
Alibaba | qwen/qwen-max-2025-01-25 32K $1.6 / 1M tokens $6.4 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Free Executions Accuracy Cost Duration
Other Models by qwen