Qwen: Qwen3.6 Flash

Video input Text input Image input Text output
Author's Description

Qwen3.6 Flash is a fast, efficient language model from Alibaba's Qwen 3.6 series. It supports text, image, and video input with a 1M token context window. Tiered pricing kicks in...

Key Specifications
Cost
$$$$$
Context
1M
Released
Apr 26, 2026
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Presence Penalty Seed Top P Include Reasoning Max Tokens Response Format Tool Choice Tools Temperature Structured Outputs Reasoning
Features

This model supports the following features:

Response Format Tools Structured Outputs Reasoning
Performance Summary

Qwen3.6 Flash, released by qwen on April 26, 2026, is a fast and efficient multimodal language model with a substantial 1M token context window. It consistently ranks among the fastest models, achieving an Infinityth percentile across 8 benchmarks, and offers highly competitive pricing, also at an Infinityth percentile across 4 benchmarks. The model demonstrates strong reliability with an 88% success rate across 8 benchmarks, indicating consistent operational stability. In terms of benchmark performance, Qwen3.6 Flash shows a notable strength in "Hallucinations (Baseline)" with 96.0% accuracy, suggesting a good ability to acknowledge uncertainty. It also performs well in "Email Classification (Baseline)" at 99.0% accuracy, indicating strong contextual understanding for categorization tasks. Its "Instruction Following (Baseline)" is respectable at 70.0% accuracy. However, a significant weakness is observed in core cognitive areas such as "Coding," "General Knowledge," "Reasoning," "Ethics," and "Mathematics," where it scored 0.0% accuracy across all these benchmarks. This suggests that while the model excels in speed, cost-efficiency, and certain classification/hallucination avoidance tasks, its capabilities in complex problem-solving, factual recall, and ethical reasoning are currently undeveloped or not adequately captured by these specific tests.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.188
Completion $1.13
Input Cache Write $0.234

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Alibaba
Alibaba | qwen/qwen3.6-flash 1M $0.188 / 1M tokens $1.13 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by qwen