Qwen: Qwen-Max

Text input Text output
Author's Description

Qwen-Max, based on Qwen2.5, provides the best inference performance among [Qwen models](/qwen), especially for complex multi-step tasks. It's a large-scale MoE model that has been pretrained on over 20 trillion tokens and further post-trained with curated Supervised Fine-Tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF) methodologies. The parameter count is unknown.

Key Specifications
Cost
$$$$
Context
32K
Released
Feb 01, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Tools Tool Choice Response Format Seed Top P Max Tokens Temperature Presence Penalty
Features

This model supports the following features:

Tools Response Format
Performance Summary

Qwen-Max, released on February 1, 2025, demonstrates competitive response times, ranking in the 55th percentile across seven benchmarks. It offers moderate pricing, positioned in the 27th percentile, making it a cost-effective option for many applications. A standout feature is its exceptional reliability, boasting a 99% success rate, indicating minimal technical failures and consistent performance. The model exhibits perfect accuracy in Hallucinations (Baseline) tests, effectively acknowledging uncertainty for fictional concepts, and is noted as the most accurate model at its price point and speed. It also performs strongly in Ethics (99% accuracy) and Email Classification (98% accuracy). Key strengths include its robust instruction following capabilities, achieving 71% accuracy and ranking in the 82nd percentile, and solid performance in Reasoning (82% accuracy) and Coding (86% accuracy). While its General Knowledge (97% accuracy) is respectable, it falls within the 48th percentile, suggesting room for improvement compared to top-tier models in this specific area. Overall, Qwen-Max excels in tasks requiring precision, ethical considerations, and complex instruction adherence, making it particularly well-suited for multi-step tasks as described by its provider.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $1.6
Completion $6.4
Input Cache Read $0.64

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Alibaba
Alibaba | qwen/qwen-max-2025-01-25 32K $1.6 / 1M tokens $6.4 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by qwen