Qwen: Qwen3 235B A22B Instruct 2507

Text input Text output
Author's Description

Qwen3-235B-A22B-Instruct-2507 is a multilingual, instruction-tuned mixture-of-experts language model based on the Qwen3-235B architecture, with 22B active parameters per forward pass. It is optimized for general-purpose text generation, including instruction following, logical reasoning, math, code, and tool usage. The model supports a native 262K context length and does not implement "thinking mode" (<think> blocks). Compared to its base variant, this version delivers significant gains in knowledge coverage, long-context reasoning, coding benchmarks, and alignment with open-ended tasks. It is particularly strong on multilingual understanding, math reasoning (e.g., AIME, HMMT), and alignment evaluations like Arena-Hard and WritingBench.

Key Specifications
Cost
$$$
Context
262K
Parameters
235B
Released
Jul 21, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Tools Tool Choice Stop Seed Min P Top P Max Tokens Frequency Penalty Temperature Presence Penalty
Features

This model supports the following features:

Tools
Performance Summary

Qwen3-235B-A22B-Instruct-2507 demonstrates competitive response times, performing among the faster models with a 41st percentile speed ranking. It also offers cost-effective solutions, ranking in the 66th percentile for price. Notably, the model exhibits exceptional reliability, achieving a 99% success rate across benchmarks, indicating minimal technical failures. The model excels in several key areas. It achieved perfect accuracy in Keyword Topic Relevance Classification and Ethics, often being the most accurate model at its price point and speed. Its performance in Mathematics is outstanding, securing the #1 spot in accuracy with 97.0%, even on PhD-level problems. General Knowledge is also a strong suit, with 99.5% accuracy. While strong in Coding (85.9%) and Reasoning (84.0%), its Instruction Following accuracy (39.7%) and Email Classification (94.0%) are areas for potential improvement, ranking in the lower percentiles for those categories. Overall, this model is particularly strong in multilingual understanding, mathematical reasoning, and alignment evaluations.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.08
Completion $0.55

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Parasail
Parasail | qwen/qwen3-235b-a22b-07-25 262K $0.08 / 1M tokens $0.55 / 1M tokens
DeepInfra
DeepInfra | qwen/qwen3-235b-a22b-07-25 262K $0.09 / 1M tokens $0.6 / 1M tokens
Targon
Targon | qwen/qwen3-235b-a22b-07-25 262K $0.08 / 1M tokens $0.55 / 1M tokens
Parasail
Parasail | qwen/qwen3-235b-a22b-07-25 262K $0.15 / 1M tokens $0.85 / 1M tokens
Fireworks
Fireworks | qwen/qwen3-235b-a22b-07-25 262K $0.22 / 1M tokens $0.88 / 1M tokens
Targon
Targon | qwen/qwen3-235b-a22b-07-25 262K $0.12 / 1M tokens $0.59 / 1M tokens
Alibaba
Alibaba | qwen/qwen3-235b-a22b-07-25 131K $0.7 / 1M tokens $2.8 / 1M tokens
Together
Together | qwen/qwen3-235b-a22b-07-25 262K $0.2 / 1M tokens $0.6 / 1M tokens
Novita
Novita | qwen/qwen3-235b-a22b-07-25 262K $0.08 / 1M tokens $0.55 / 1M tokens
GMICloud
GMICloud | qwen/qwen3-235b-a22b-07-25 131K $0.08 / 1M tokens $0.55 / 1M tokens
Novita
Novita | qwen/qwen3-235b-a22b-07-25 131K $0.15 / 1M tokens $0.8 / 1M tokens
Cerebras
Cerebras | qwen/qwen3-235b-a22b-07-25 131K $0.6 / 1M tokens $1.2 / 1M tokens
Chutes
Chutes | qwen/qwen3-235b-a22b-07-25 262K $0.08 / 1M tokens $0.55 / 1M tokens
Nebius
Nebius | qwen/qwen3-235b-a22b-07-25 262K $0.2 / 1M tokens $0.6 / 1M tokens
BaseTen
BaseTen | qwen/qwen3-235b-a22b-07-25 262K $0.22 / 1M tokens $0.8 / 1M tokens
AtlasCloud
AtlasCloud | qwen/qwen3-235b-a22b-07-25 262K $0.35 / 1M tokens $1.2 / 1M tokens
Google
Google | qwen/qwen3-235b-a22b-07-25 262K $0.25 / 1M tokens $1 / 1M tokens
Friendli
Friendli | qwen/qwen3-235b-a22b-07-25 131K $0.2 / 1M tokens $0.8 / 1M tokens
SiliconFlow
SiliconFlow | qwen/qwen3-235b-a22b-07-25 262K $0.09 / 1M tokens $0.6 / 1M tokens
WandB
WandB | qwen/qwen3-235b-a22b-07-25 262K $0.1 / 1M tokens $0.1 / 1M tokens
Chutes
Chutes | qwen/qwen3-235b-a22b-07-25 262K $0.08 / 1M tokens $0.55 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by qwen