Qwen: Qwen3 Max

Text input Text output
Author's Description

Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in reasoning, instruction following, multilingual support, and long-tail knowledge coverage compared to the January 2025 version. It delivers higher accuracy in math, coding, logic, and science tasks, follows complex instructions in Chinese and English more reliably, reduces hallucinations, and produces higher-quality responses for open-ended Q&A, writing, and conversation. The model supports over 100 languages with stronger translation and commonsense reasoning, and is optimized for retrieval-augmented generation (RAG) and tool calling, though it does not include a dedicated “thinking” mode.

Key Specifications
Cost
$$$$
Context
256K
Released
Sep 23, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Response Format Max Tokens Tool Choice Top P Seed Tools Temperature Presence Penalty
Features

This model supports the following features:

Tools Response Format
Performance Summary

Qwen3-Max, released in September 2025, demonstrates a strong overall performance profile, particularly excelling in reliability with a perfect 100% success rate across all benchmarks, indicating exceptional stability. The model generally performs in the top tier for speed, ranking in the 63rd percentile, and offers moderate pricing, positioned at the 34th percentile. A key strength of Qwen3-Max is its remarkable accuracy in critical areas. It achieved perfect scores (100%) in Hallucinations, General Knowledge, Ethics, and Email Classification, often being the most accurate model at its price point and speed for these categories. This highlights its robust knowledge base and ability to avoid generating fabricated information. While its Mathematics, Reasoning, and Coding scores are strong (93.7%, 88.0%, and 90.6% respectively), they are not perfect, suggesting areas for potential refinement. A notable weakness appears in Instruction Following, where it scored 66.7%, placing it in the 74th percentile, indicating room for improvement in handling highly complex, multi-layered directives despite its description of following complex instructions more reliably. Its long context length of 256,000 tokens further enhances its utility for demanding applications.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $1.2
Completion $6
Input Cache Read $0.24

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Alibaba
Alibaba | qwen/qwen3-max 256K $1.2 / 1M tokens $6 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by qwen