Qwen: Qwen3 235B A22B

Text input Text output
Author's Description

Qwen3-235B-A22B is a 235B parameter mixture-of-experts (MoE) model developed by Qwen, activating 22B parameters per forward pass. It supports seamless switching between a "thinking" mode for complex reasoning, math, and...

Key Specifications
Cost
$$$$
Context
40K
Parameters
235B
Released
Apr 28, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Seed Min P Response Format Reasoning Temperature Presence Penalty Include Reasoning Tools Frequency Penalty Top P Stop Tool Choice Max Tokens
Features

This model supports the following features:

Response Format Tools Reasoning
Performance Summary

Qwen3-235B-A22B, a 235B parameter MoE model, exhibits exceptional reliability with a 100% success rate across all benchmarks, consistently providing usable responses. However, its speed performance is a notable area for improvement, tending to have longer response times, ranking in the 10th percentile. Pricing is moderate, falling into the 38th percentile. The model demonstrates significant strengths in several key areas. It achieved perfect accuracy in both General Knowledge and Email Classification, with the latter also being the most accurate and fastest among models at its price point. Its Coding capabilities are outstanding, scoring 97.0% accuracy (97th percentile), and it shows strong Reasoning abilities at 98.0% accuracy (91st percentile). Ethics performance is also solid at 99.0%. The primary weakness lies in Instruction Following, where it achieved only 40.4% accuracy (32nd percentile), indicating challenges with complex, multi-layered instructions. Despite its "thinking" mode for complex tasks, the model's overall speed remains a bottleneck.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.455
Completion $1.82

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
DeepInfra
DeepInfra | qwen/qwen3-235b-a22b-04-28 40K $0.455 / 1M tokens $1.82 / 1M tokens
Parasail
Parasail | qwen/qwen3-235b-a22b-04-28 40K $0.455 / 1M tokens $1.82 / 1M tokens
Kluster
Kluster | qwen/qwen3-235b-a22b-04-28 40K $0.455 / 1M tokens $1.82 / 1M tokens
GMICloud
GMICloud | qwen/qwen3-235b-a22b-04-28 32K $0.455 / 1M tokens $1.82 / 1M tokens
Together
Together | qwen/qwen3-235b-a22b-04-28 40K $0.455 / 1M tokens $1.82 / 1M tokens
Nebius
Nebius | qwen/qwen3-235b-a22b-04-28 40K $0.455 / 1M tokens $1.82 / 1M tokens
Novita
Novita | qwen/qwen3-235b-a22b-04-28 40K $0.455 / 1M tokens $1.82 / 1M tokens
Fireworks
Fireworks | qwen/qwen3-235b-a22b-04-28 131K $0.455 / 1M tokens $1.82 / 1M tokens
Friendli
Friendli | qwen/qwen3-235b-a22b-04-28 131K $0.455 / 1M tokens $1.82 / 1M tokens
Cerebras
Cerebras | qwen/qwen3-235b-a22b-04-28 40K $0.455 / 1M tokens $1.82 / 1M tokens
Chutes
Chutes | qwen/qwen3-235b-a22b-04-28 40K $0.455 / 1M tokens $1.82 / 1M tokens
Chutes
Chutes | qwen/qwen3-235b-a22b-04-28 40K $0.455 / 1M tokens $1.82 / 1M tokens
Alibaba
Alibaba | qwen/qwen3-235b-a22b-04-28 131K $0.455 / 1M tokens $1.82 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by qwen