Author's Description
Qwen3-4B is a 4 billion parameter dense language model from the Qwen3 series, designed to support both general-purpose and reasoning-intensive tasks. It introduces a dual-mode architecture—thinking and non-thinking—allowing dynamic switching between high-precision logical reasoning and efficient dialogue generation. This makes it well-suited for multi-turn chat, instruction following, and complex agent workflows.
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
Qwen3 4B, a 4 billion parameter model from qwen, demonstrates moderate speed performance, ranking in the 20th percentile across benchmarks. It offers competitive pricing, positioned at the 50th percentile. Notably, the model exhibits exceptional reliability with a 98% success rate, indicating minimal technical failures and consistent response generation. In terms of performance across categories, Qwen3 4B shows strong capabilities in Reasoning (96.0% accuracy, 87th percentile) and General Knowledge (99.0% accuracy, 66th percentile), suggesting proficiency in complex problem-solving and broad factual recall. Its Mathematics performance is also solid at 89.0% accuracy (58th percentile). However, the model struggles with Instruction Following (44.9% accuracy, 39th percentile) and Hallucinations (82.0% accuracy, 28th percentile), indicating areas for improvement in adhering to complex directives and acknowledging uncertainty. Email Classification also presents a weakness with 93.0% accuracy (22nd percentile). Its dual-mode architecture aims to balance high-precision reasoning with efficient dialogue, making it suitable for multi-turn chat and agent workflows despite some accuracy limitations in specific tasks.
Model Pricing
Current Pricing
| Feature | Price (per 1M tokens) |
|---|---|
| Prompt | $0.0715 |
| Completion | $0.273 |
Price History
Available Endpoints
| Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
|---|---|---|---|---|
|
Alibaba
|
Alibaba | qwen/qwen3-4b-04-28 | 131K | $0.0715 / 1M tokens | $0.273 / 1M tokens |
Benchmark Results
| Benchmark | Category | Reasoning | Strategy | Free | Executions | Accuracy | Cost | Duration |
|---|
Other Models by qwen
|
|
Released | Params | Context |
|
Speed | Ability | Cost |
|---|---|---|---|---|---|---|---|
| Qwen: Qwen3.5 Plus 2026-02-15 | Feb 16, 2026 | — | 1M |
Image input
Text input
Video input
Text output
|
★★★★ | ★ | $$$ |
| Qwen: Qwen3.5 397B A17B | Feb 15, 2026 | 397B | 262K |
Image input
Text input
Video input
Text output
|
★ | ★★★★★ | $$$$$ |
| Qwen: Qwen3 Max Thinking | Feb 09, 2026 | — | 262K |
Text input
Text output
|
★★ | ★★★★ | $$$$$ |
| Qwen: Qwen3 Coder Next | Feb 03, 2026 | ~80B | 262K |
Text input
Text output
|
★ | ★★★★ | $$$$ |
| Qwen: Qwen3 VL 32B Instruct | Oct 23, 2025 | 32B | 262K |
Image input
Text input
Text output
|
★★★ | ★★★★★ | $$ |
| Qwen: Qwen3 VL 8B Thinking | Oct 14, 2025 | 8B | 131K |
Image input
Text input
Text output
|
★ | ★ | $$$$$ |
| Qwen: Qwen3 VL 8B Instruct | Oct 14, 2025 | 8B | 131K |
Image input
Text input
Text output
|
★ | ★★ | $$$ |
| Qwen: Qwen3 VL 30B A3B Thinking | Oct 06, 2025 | 30B | 262K |
Image input
Text input
Text output
|
★ | ★★★ | $$$$ |
| Qwen: Qwen3 VL 30B A3B Instruct | Oct 06, 2025 | 30B | 131K |
Image input
Text input
Text output
|
— | — | $$$ |
| Qwen: Qwen3 VL 235B A22B Thinking | Sep 23, 2025 | 235B | 131K |
Image input
Text input
Text output
|
★ | ★ | $$$$$ |
| Qwen: Qwen3 VL 235B A22B Instruct | Sep 23, 2025 | 235B | 131K |
Image input
Text input
Text output
|
★★★★ | ★★★★ | $$$ |
| Qwen: Qwen3 Max | Sep 23, 2025 | — | 262K |
Text input
Text output
|
★★★★ | ★★★★★ | $$$$ |
| Qwen: Qwen3 Coder Plus | Sep 23, 2025 | ~480B | 1M |
Text input
Text output
|
★★★★ | ★★★★ | $$$$ |
| Qwen: Qwen3 Coder Flash | Sep 17, 2025 | — | 1M |
Text input
Text output
|
★★★★ | ★★★ | $$$ |
| Qwen: Qwen3 Next 80B A3B Thinking | Sep 11, 2025 | 80B | 131K |
Text input
Text output
|
★ | ★★★★ | $$$$$ |
| Qwen: Qwen3 Next 80B A3B Instruct | Sep 11, 2025 | 80B | 262K |
Text input
Text output
|
★★★★ | ★★★★ | $$$$ |
| Qwen: Qwen Plus 0728 | Sep 08, 2025 | ~20B | 1M |
Text input
Text output
|
★★★★★ | ★★★ | $$$ |
| Qwen: Qwen3 30B A3B Thinking 2507 | Aug 28, 2025 | 30B | 262K |
Text input
Text output
|
★★ | ★★★ | $$$$ |
| Qwen: Qwen3 Coder 30B A3B Instruct | Jul 31, 2025 | 30B | 262K |
Text input
Text output
|
★★★★ | ★★★ | $$ |
| Qwen: Qwen3 30B A3B Instruct 2507 | Jul 29, 2025 | 30B | 131K |
Text input
Text output
|
★★★★ | ★★★★ | $$$ |
| Qwen: Qwen3 235B A22B Thinking 2507 | Jul 25, 2025 | 235B | 131K |
Text input
Text output
|
★ | ★★★★ | $$$$$ |
| Qwen: Qwen3 Coder 480B A35B | Jul 22, 2025 | 480B | 1M |
Text input
Text output
|
★★ | ★★★ | $$$ |
| Qwen: Qwen3 Coder 480B A35B (exacto) | Jul 22, 2025 | 480B | 262K |
Text input
Text output
|
— | — | $$$$ |
| Qwen: Qwen3 235B A22B Instruct 2507 | Jul 21, 2025 | 235B | 262K |
Text input
Text output
|
★★ | ★★★ | $$$ |
| Qwen: Qwen3 30B A3B | Apr 28, 2025 | 30B | 40K |
Text input
Text output
|
★★ | ★★★★★ | $$$ |
| Qwen: Qwen3 8B | Apr 28, 2025 | 8B | 128K |
Text input
Text output
|
★ | ★★★ | $$$ |
| Qwen: Qwen3 14B | Apr 28, 2025 | 14B | 40K |
Text input
Text output
|
★★ | ★★★ | $$$ |
| Qwen: Qwen3 32B | Apr 28, 2025 | 32B | 40K |
Text input
Text output
|
★ | ★★★★ | $$$ |
| Qwen: Qwen3 235B A22B | Apr 28, 2025 | 235B | 40K |
Text input
Text output
|
★ | ★★★★ | $$$$ |
| Qwen: Qwen2.5 Coder 7B Instruct | Apr 15, 2025 | 7B | 32K |
Text input
Text output
|
— | — | $ |
| Qwen: Qwen2.5 VL 32B Instruct | Mar 24, 2025 | 32B | 128K |
Image input
Text input
Text output
|
★ | ★★★ | $$$ |
| Qwen: QwQ 32B | Mar 05, 2025 | 32B | 131K |
Text input
Text output
|
★ | ★★ | $$$ |
| Qwen: Qwen VL Plus | Feb 04, 2025 | — | 131K |
Image input
Text input
Text output
|
★★★★ | ★★ | $$$ |
| Qwen: Qwen VL Max | Feb 01, 2025 | — | 131K |
Image input
Text input
Text output
|
★★★ | ★★★ | $$$$ |
| Qwen: Qwen-Turbo | Feb 01, 2025 | — | 131K |
Text input
Text output
|
★★★★★ | ★★★★ | $$ |
| Qwen: Qwen2.5 VL 72B Instruct | Feb 01, 2025 | 72B | 32K |
Image input
Text input
Text output
|
★★★★ | ★★★★ | $$ |
| Qwen: Qwen-Plus | Feb 01, 2025 | — | 1M |
Text input
Text output
|
★★★★ | ★★★★ | $$$ |
| Qwen: Qwen-Max | Feb 01, 2025 | — | 32K |
Text input
Text output
|
★★★★ | ★★★★ | $$$$ |
| Qwen: QwQ 32B Preview Unavailable | Nov 27, 2024 | 32B | 32K |
Text input
Text output
|
— | ★ | $$ |
| Qwen2.5 Coder 32B Instruct | Nov 11, 2024 | ~500B | 32K |
Text input
Text output
|
★★★★★ | ★★★★★ | $ |
| Qwen: Qwen2.5 7B Instruct | Oct 15, 2024 | ~500B | 32K |
Text input
Text output
|
★ | ★★ | $ |
| Qwen2.5 72B Instruct | Sep 18, 2024 | ~500B | 32K |
Text input
Text output
|
★★★ | ★★ | $$ |
| Qwen: Qwen2.5-VL 7B Instruct | Aug 27, 2024 | ~500B | 32K |
Image input
Text input
Text output
|
★★★★ | ★★ | $$ |
| Qwen 2 72B Instruct Unavailable | Jun 06, 2024 | ~500B | 32K |
Text input
Text output
|
★★★★ | ★★ | $$$$ |