Qwen: Qwen3 4B

Name: Qwen: Qwen3 4B
Brand: qwen
Availability: OutOfStock
Rating: 3.1 (8 reviews)

Back

Text input Text output Unavailable

Author's Description

Qwen3-4B is a 4 billion parameter dense language model from the Qwen3 series, designed to support both general-purpose and reasoning-intensive tasks. It introduces a dual-mode architecture—thinking and non-thinking—allowing dynamic switching between high-precision logical reasoning and efficient dialogue generation. This makes it well-suited for multi-turn chat, instruction following, and complex agent workflows.

Key Specifications

Cost

$$$

Context

131K

Parameters

Released

Apr 30, 2025

Speed

★★

Ability

★★★

Reliability

★★

Hugging Face

Supported Parameters

This model supports the following parameters:

Include Reasoning Max Tokens Tool Choice Tools Response Format Seed Reasoning Top P Presence Penalty Temperature

Features

This model supports the following features:

Response Format Tools Reasoning

Performance Summary

Qwen3 4B, a 4 billion parameter model from qwen, demonstrates moderate speed performance, ranking in the 20th percentile across benchmarks. It offers competitive pricing, positioned at the 50th percentile. Notably, the model exhibits exceptional reliability with a 98% success rate, indicating minimal technical failures and consistent response generation. In terms of performance across categories, Qwen3 4B shows strong capabilities in Reasoning (96.0% accuracy, 87th percentile) and General Knowledge (99.0% accuracy, 66th percentile), suggesting proficiency in complex problem-solving and broad factual recall. Its Mathematics performance is also solid at 89.0% accuracy (58th percentile). However, the model struggles with Instruction Following (44.9% accuracy, 39th percentile) and Hallucinations (82.0% accuracy, 28th percentile), indicating areas for improvement in adhering to complex directives and acknowledging uncertainty. Email Classification also presents a weakness with 93.0% accuracy (22nd percentile). Its dual-mode architecture aims to balance high-precision reasoning with efficient dialogue, making it suitable for multi-turn chat and agent workflows despite some accuracy limitations in specific tasks.

Model Pricing

Current Pricing

Feature	Price (per 1M tokens)
Prompt	$0.0715
Completion	$0.273

Price History

Available Endpoints

Provider	Endpoint Name	Context Length	Pricing (Input)	Pricing (Output)
Alibaba	Alibaba \| qwen/qwen3-4b-04-28	131K	$0.0715 / 1M tokens	$0.273 / 1M tokens

Benchmark Results

Benchmark	Category	Reasoning	Strategy	Free	Executions	Accuracy	Cost	Duration

Other Models by qwen

	Released	Params	Context	Filter by Modalities All Modalities	Speed	Ability	Cost
Qwen: Qwen3.7 Plus Unavailable	Jun 03, 2026	—	1M	Image input Text input Text output	★	★★★★	$$$$$
Qwen: Qwen3.7 Plus	Jun 03, 2026	—	1M	Image input Text input Text output	★	★★★★	$$$$$
Qwen: Qwen3.7 Max	May 21, 2026	—	1M	Text input Text output	★★	★★★★★	$$$$$
Qwen: Qwen3.5 Plus 2026-04-20	Apr 26, 2026	—	1M	Image input Text input Video input Text output	★	★★★★	$$$$$
Qwen: Qwen3.6 Flash	Apr 26, 2026	—	1M	Image input Text input Video input Text output	★★	★	$$$$$
Qwen: Qwen3.6 35B A3B	Apr 26, 2026	35B	262K	Image input Text input Video input Text output	★★	★★★★	$$$$$
Qwen: Qwen3.6 Max Preview	Apr 26, 2026	~1T	262K	Text input Text output	★	★★★★★	$$$$$
Qwen: Qwen3.6 27B	Apr 26, 2026	27B	262K	Image input Text input Video input Text output	★	★★★	$$$$$
Qwen: Qwen3.6 Plus	Apr 02, 2026	—	1M	Image input Text input Video input Text output	★	★	$$$$$
Qwen: Qwen3.5-9B	Mar 10, 2026	9B	262K	Image input Text input Video input Text output	★	★★	$$$$
Qwen: Qwen3.5-35B-A3B	Feb 25, 2026	35B	262K	Image input Text input Video input Text output	★	★	$$$$$
Qwen: Qwen3.5-27B	Feb 25, 2026	27B	262K	Image input Text input Video input Text output	★	★	$$$$$
Qwen: Qwen3.5-122B-A10B	Feb 25, 2026	122B	262K	Image input Text input Video input Text output	★	★	$$$$$
Qwen: Qwen3.5-Flash	Feb 25, 2026	—	1M	Image input Text input Video input Text output	★★	★	$$$$
Qwen: Qwen3.5 Plus 2026-02-15	Feb 16, 2026	—	1M	Image input Text input Video input Text output	★★★★	★	$$$
Qwen: Qwen3.5 397B A17B	Feb 15, 2026	397B	262K	Image input Text input Video input Text output	★	★★★★★	$$$$$
Qwen: Qwen3 Max Thinking	Feb 09, 2026	—	262K	Text input Text output	★★	★★★★	$$$$$
Qwen: Qwen3 Coder Next	Feb 03, 2026	~80B	262K	Text input Text output	★	★★★★	$$$$
Qwen: Qwen3 VL 32B Instruct	Oct 23, 2025	32B	262K	Image input Text input Text output	★★★	★★★★★	$$
Qwen: Qwen3 VL 8B Thinking	Oct 14, 2025	8B	131K	Image input Text input Text output	★	★	$$$$$
Qwen: Qwen3 VL 8B Instruct	Oct 14, 2025	8B	131K	Image input Text input Text output	★	★★	$$$
Qwen: Qwen3 VL 30B A3B Thinking	Oct 06, 2025	30B	262K	Image input Text input Text output	★	★★★	$$$$
Qwen: Qwen3 VL 30B A3B Instruct	Oct 06, 2025	30B	131K	Image input Text input Text output	—	—	$$$
Qwen: Qwen3 VL 235B A22B Thinking	Sep 23, 2025	235B	131K	Image input Text input Text output	★	★	$$$$$
Qwen: Qwen3 VL 235B A22B Instruct	Sep 23, 2025	235B	131K	Image input Text input Text output	★★★★	★★★★	$$$
Qwen: Qwen3 Max	Sep 23, 2025	—	262K	Text input Text output	★★★★	★★★★	$$$$
Qwen: Qwen3 Coder Plus	Sep 23, 2025	~480B	1M	Text input Text output	★★★★	★★★★	$$$$
Qwen: Qwen3 Coder Flash	Sep 17, 2025	—	1M	Text input Text output	★★★★	★★★	$$$
Qwen: Qwen3 Next 80B A3B Thinking	Sep 11, 2025	80B	131K	Text input Text output	★	★★★	$$$$$
Qwen: Qwen3 Next 80B A3B Instruct	Sep 11, 2025	80B	262K	Text input Text output	★★★★	★★★★	$$$$
Qwen: Qwen Plus 0728	Sep 08, 2025	~20B	1M	Text input Text output	★★★★★	★★★	$$
Qwen: Qwen3 30B A3B Thinking 2507	Aug 28, 2025	30B	262K	Text input Text output	★★	★★★	$$$
Qwen: Qwen3 Coder 30B A3B Instruct	Jul 31, 2025	30B	262K	Text input Text output	★★★★	★★	$$
Qwen: Qwen3 30B A3B Instruct 2507	Jul 29, 2025	30B	131K	Text input Text output	★★★★	★★★	$$$
Qwen: Qwen3 235B A22B Thinking 2507	Jul 25, 2025	235B	131K	Text input Text output	★	★★★★	$$$$$
Qwen: Qwen3 Coder 480B A35B	Jul 22, 2025	480B	1M	Text input Text output	★★	★★★	$$$
Qwen: Qwen3 Coder 480B A35B (exacto) Unavailable	Jul 22, 2025	480B	262K	Text input Text output	—	—	$$$$
Qwen: Qwen3 235B A22B Instruct 2507	Jul 21, 2025	235B	262K	Text input Text output	★★	★★★	$$$
Qwen: Qwen3 30B A3B	Apr 28, 2025	30B	40K	Text input Text output	★★	★★★★	$$$
Qwen: Qwen3 8B	Apr 28, 2025	8B	128K	Text input Text output	★	★★	$$$
Qwen: Qwen3 14B	Apr 28, 2025	14B	40K	Text input Text output	★★	★★★	$$$
Qwen: Qwen3 32B	Apr 28, 2025	32B	40K	Text input Text output	★	★★★★	$$$
Qwen: Qwen3 235B A22B	Apr 28, 2025	235B	40K	Text input Text output	★	★★★★	$$$$
Qwen: Qwen2.5 Coder 7B Instruct Unavailable	Apr 15, 2025	7B	32K	Text input Text output	—	—	$
Qwen: Qwen2.5 VL 32B Instruct Unavailable	Mar 24, 2025	32B	128K	Image input Text input Text output	★	★★★	$$$
Qwen: QwQ 32B Unavailable	Mar 05, 2025	32B	131K	Text input Text output	★	★★	$$$
Qwen: Qwen VL Plus Unavailable	Feb 04, 2025	—	131K	Image input Text input Text output	★★★★	★★	$$$
Qwen: Qwen VL Max Unavailable	Feb 01, 2025	—	131K	Image input Text input Text output	★★★★	★★	$$$$
Qwen: Qwen-Turbo Unavailable	Feb 01, 2025	—	131K	Text input Text output	★★★★★	★★★★	$$
Qwen: Qwen2.5 VL 72B Instruct	Feb 01, 2025	72B	32K	Image input Text input Text output	★★★★	★★★★	$$
Qwen: Qwen-Plus	Feb 01, 2025	—	1M	Text input Text output	★★★★	★★★★	$$$
Qwen: Qwen-Max Unavailable	Feb 01, 2025	—	32K	Text input Text output	★★★★	★★★★	$$$$
Qwen: QwQ 32B Preview Unavailable	Nov 27, 2024	32B	32K	Text input Text output	—	★	$$
Qwen2.5 Coder 32B Instruct	Nov 11, 2024	~500B	32K	Text input Text output	★★★★★	★★★★★	$
Qwen: Qwen2.5 7B Instruct	Oct 15, 2024	~500B	32K	Text input Text output	★	★★	$
Qwen2.5 72B Instruct	Sep 18, 2024	~500B	32K	Text input Text output	★★★	★★	$$
Qwen: Qwen2.5-VL 7B Instruct Unavailable	Aug 27, 2024	~500B	32K	Image input Text input Text output	★★★★	★★	$$
Qwen 2 72B Instruct Unavailable	Jun 06, 2024	~500B	32K	Text input Text output	★★★★	★★	$$$$