Qwen: Qwen3 8B

Name: Qwen: Qwen3 8B
Brand: qwen
Price: 3.5e-8 USD
Availability: InStock
Rating: 2.8 (7 reviews)

Back

Text input Text output

Author's Description

Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed for both reasoning-heavy tasks and efficient dialogue. It supports seamless switching between "thinking" mode for math, coding, and logical inference, and "non-thinking" mode for general conversation. The model is fine-tuned for instruction-following, agent integration, creative writing, and multilingual use across 100+ languages and dialects. It natively supports a 32K token context window and can extend to 131K tokens with YaRN scaling.

Key Specifications

Cost

$$$

Context

128K

Parameters

Released

Apr 28, 2025

Speed

★

Ability

★★★

Reliability

★★

Hugging Face

Supported Parameters

This model supports the following parameters:

Stop Presence Penalty Max Tokens Frequency Penalty Top P Seed Temperature Reasoning Include Reasoning

Features

This model supports the following features:

Reasoning

Performance Summary

Qwen3-8B, a dense 8.2B parameter model, generally exhibits longer response times, ranking in the 15th percentile for speed across benchmarks. Conversely, it offers cost-effective solutions, placing in the 67th percentile for price. The model demonstrates strong reliability with an 89% success rate, indicating consistent and usable responses. In terms of performance across categories, Qwen3-8B shows notable strengths in General Knowledge (97.5% accuracy) and Ethics (99.0% accuracy), performing well within the 55th and 60th percentiles respectively. It also achieves a respectable 98.0% accuracy in Email Classification. Its instruction-following capabilities are moderate at 60.0% accuracy. However, the model struggles significantly with Coding, achieving only 34.0% accuracy (19th percentile), and shows average performance in Reasoning (56.0% accuracy) and Mathematics (85.0% accuracy). Its extended context window of 131K tokens and multilingual support are key features.

Model Pricing

Current Pricing

Feature	Price (per 1M tokens)
Prompt	$0.035
Completion	$0.138

Price History

Available Endpoints

Provider	Endpoint Name	Context Length	Pricing (Input)	Pricing (Output)
Novita	Novita \| qwen/qwen3-8b-04-28	128K	$0.035 / 1M tokens	$0.138 / 1M tokens
Chutes	Chutes \| qwen/qwen3-8b-04-28	40K	$0.035 / 1M tokens	$0.138 / 1M tokens
Fireworks	Fireworks \| qwen/qwen3-8b-04-28	40K	$0.2 / 1M tokens	$0.2 / 1M tokens

Benchmark Results

Benchmark	Category	Reasoning	Strategy	Free	Executions	Accuracy	Cost	Duration

Other Models by qwen

	Released	Params	Context	Filter by Modalities All Modalities	Speed	Ability	Cost
Qwen: Qwen3 VL 32B Instruct Unavailable	Oct 23, 2025	32B	262K	Image input Text input Text output	★★★	★★★★★	$$
Qwen: Qwen3 VL 8B Thinking	Oct 14, 2025	8B	256K	Image input Text input Text output	★	★	$$$$$
Qwen: Qwen3 VL 8B Instruct	Oct 14, 2025	8B	256K	Image input Text input Text output	★	★★	$$$
Qwen: Qwen3 VL 30B A3B Thinking	Oct 06, 2025	30B	262K	Image input Text input Text output	★	★★★	$$$$
Qwen: Qwen3 VL 30B A3B Instruct	Oct 06, 2025	30B	131K	Image input Text input Text output	—	—	$$$
Qwen: Qwen3 VL 235B A22B Thinking	Sep 23, 2025	235B	131K	Image input Text input Text output	★	★	$$$$$
Qwen: Qwen3 VL 235B A22B Instruct	Sep 23, 2025	235B	131K	Image input Text input Text output	★★★	★★★★★	$$$
Qwen: Qwen3 Max	Sep 23, 2025	—	256K	Text input Text output	★★★★	★★★★★	$$$$
Qwen: Qwen3 Coder Plus	Sep 23, 2025	~480B	128K	Text input Text output	★★★★	★★★★	$$$$
Qwen: Qwen3 Coder Flash	Sep 17, 2025	—	128K	Text input Text output	★★★★	★★★	$$$
Qwen: Qwen3 Next 80B A3B Thinking	Sep 11, 2025	80B	262K	Text input Text output	★	★★★★	$$$$$
Qwen: Qwen3 Next 80B A3B Instruct	Sep 11, 2025	80B	262K	Text input Text output	★★★★	★★★★	$$$$
Qwen: Qwen Plus 0728	Sep 08, 2025	~20B	1M	Text input Text output	★★★★★	★★★	$$$
Qwen: Qwen3 30B A3B Thinking 2507	Aug 28, 2025	30B	262K	Text input Text output	★★	★★★	$$$$
Qwen: Qwen3 Coder 30B A3B Instruct	Jul 31, 2025	30B	200K	Text input Text output	★★★★	★★★	$$
Qwen: Qwen3 30B A3B Instruct 2507	Jul 29, 2025	30B	131K	Text input Text output	★★★★	★★★★	$$$
Qwen: Qwen3 235B A22B Thinking 2507	Jul 25, 2025	235B	131K	Text input Text output	★	★★★★	$$$$$
Qwen: Qwen3 Coder 480B A35B	Jul 22, 2025	480B	1M	Text input Text output	★★	★★★	$$$
Qwen: Qwen3 Coder 480B A35B (exacto)	Jul 22, 2025	480B	262K	Text input Text output	—	—	$$$$
Qwen: Qwen3 235B A22B Instruct 2507	Jul 21, 2025	235B	262K	Text input Text output	★★	★★★	$$$
Qwen: Qwen3 30B A3B	Apr 28, 2025	30B	40K	Text input Text output	★	★★★★★	$$$$
Qwen: Qwen3 14B	Apr 28, 2025	14B	40K	Text input Text output	★★	★★★	$$$
Qwen: Qwen3 32B	Apr 28, 2025	32B	40K	Text input Text output	★	★★★★★	$$$
Qwen: Qwen3 235B A22B	Apr 28, 2025	235B	40K	Text input Text output	★	★★★★	$$$$
Qwen: Qwen2.5 Coder 7B Instruct	Apr 15, 2025	7B	32K	Text input Text output	—	—	$
Qwen: Qwen2.5 VL 32B Instruct	Mar 24, 2025	32B	128K	Image input Text input Text output	★	★★★	$$$
Qwen: QwQ 32B	Mar 05, 2025	32B	131K	Text input Text output	★	★★	$$$
Qwen: Qwen VL Plus	Feb 04, 2025	—	7K	Image input Text input Text output	★★★★	★★	$$$
Qwen: Qwen VL Max	Feb 01, 2025	—	131K	Image input Text input Text output	★★★	★★★	$$$$
Qwen: Qwen-Turbo	Feb 01, 2025	—	1M	Text input Text output	★★★★★	★★★★	$$
Qwen: Qwen2.5 VL 72B Instruct	Feb 01, 2025	72B	32K	Image input Text input Text output	★★★★	★★★★	$$
Qwen: Qwen-Plus	Feb 01, 2025	—	131K	Text input Text output	★★★★	★★★★	$$$
Qwen: Qwen-Max	Feb 01, 2025	—	32K	Text input Text output	★★★★	★★★★	$$$$
Qwen: QwQ 32B Preview Unavailable	Nov 27, 2024	32B	32K	Text input Text output	—	★	$$
Qwen2.5 Coder 32B Instruct	Nov 11, 2024	~500B	32K	Text input Text output	★★★★★	★★★★★	$
Qwen: Qwen2.5 7B Instruct	Oct 15, 2024	~500B	32K	Text input Text output	★	★★	$
Qwen2.5 72B Instruct	Sep 18, 2024	~500B	131K	Text input Text output	★★★	★★	$$
Qwen: Qwen2.5-VL 7B Instruct	Aug 27, 2024	~500B	32K	Image input Text input Text output	★★★★	★★	$$
Qwen 2 72B Instruct Unavailable	Jun 06, 2024	~500B	32K	Text input Text output	★★★★	★★	$$$$