Qwen: Qwen3.6 Max Preview

Text input Text output
Author's Description

Qwen3.6-Max-Preview is a proprietary frontier model from Alibaba Cloud built on a sparse mixture-of-experts architecture with approximately 1 trillion total parameters. It is optimized for agentic coding, tool use, and...

Key Specifications
Cost
$$$$$
Context
262K
Parameters
1T (Rumoured)
Released
Apr 26, 2026
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Tools Response Format Top Logprobs Tool Choice Logprobs Include Reasoning Temperature Max Tokens Reasoning Structured Outputs Presence Penalty Top P Seed
Features

This model supports the following features:

Structured Outputs Tools Reasoning Response Format
Performance Summary

Qwen3.6-Max-Preview, a proprietary frontier model from Alibaba Cloud, demonstrates exceptional reliability with a 100% success rate across benchmarks, indicating consistent and usable responses. However, its performance is characterized by longer response times, ranking in the 6th percentile for speed, and it is positioned at premium pricing levels, falling into the 4th percentile for cost competitiveness. In terms of specific benchmarks, the model achieved perfect accuracy (100.0%) in the Hallucinations (Baseline) test, effectively acknowledging uncertainty for fictional concepts. This outstanding performance was noted as the most accurate among models at its price point and speed. For Email Classification (Baseline), Qwen3.6-Max-Preview achieved a strong 99.0% accuracy, placing it in the 71st percentile. While its accuracy is commendable, both benchmarks show significant duration, with the Hallucinations test taking over 1.3 million milliseconds and Email Classification over 1 million milliseconds. Key strengths include its perfect accuracy in identifying fictional concepts and its overall high accuracy in classification tasks, coupled with its robust reliability. The primary weaknesses are its slow response times and premium pricing, which may impact its suitability for latency-sensitive or cost-constrained applications.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $1.3
Completion $7.8
Input Cache Write $1.63

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Alibaba
Alibaba | qwen/qwen3.6-max-preview-20260420 262K $1.3 / 1M tokens $7.8 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by qwen