Qwen 2 72B Instruct

Name: Qwen 2 72B Instruct
Brand: qwen
Availability: OutOfStock
Rating: 2.0 (5 reviews)

Back

Text input Text output Unavailable

Author's Description

Qwen2 72B is a transformer-based model that excels in language understanding, multilingual capabilities, coding, mathematics, and reasoning. It features SwiGLU activation, attention QKV bias, and group query attention. It is pretrained on extensive data with supervised finetuning and direct preference optimization. For more details, see this [blog post](https://qwenlm.github.io/blog/qwen2/) and [GitHub repo](https://github.com/QwenLM/Qwen2). Usage of this model is subject to [Tongyi Qianwen LICENSE AGREEMENT](https://huggingface.co/Qwen/Qwen1.5-110B-Chat/blob/main/LICENSE).

Key Specifications

Cost

$$$$

Context

32K

Parameters

500B (Rumoured)

Released

Jun 06, 2024

Speed

★★★★

Ability

★★

Reliability

★

Hugging Face

Supported Parameters

This model supports the following parameters:

Stop Presence Penalty Logit Bias Max Tokens Frequency Penalty Top P Temperature Min P

Performance Summary

Qwen 2 72B Instruct, released on June 6, 2024, demonstrates a strong overall performance profile. It performs among the fastest models, ranking in the 61st percentile for speed across five benchmarks, and offers competitive pricing, placing in the 41st percentile. The model exhibits excellent reliability, consistently providing usable responses. In terms of specific capabilities, Qwen 2 72B Instruct shows exceptional strength in Ethics, achieving perfect 100% accuracy, making it the most accurate model at its price point and among models of comparable speed. It also performs very well in Email Classification, with 99.0% accuracy, indicating strong contextual understanding. Instruction Following is solid at 53.0% accuracy, placing it in the middle tier. However, the model shows notable weaknesses in General Knowledge, with only 20.0% accuracy, and in Coding, where it scores 59.0%. These areas suggest potential limitations in handling broad, obscure factual recall and complex programming concepts. Its architectural features, including SwiGLU activation and group query attention, contribute to its overall efficiency and performance in its strong suits.

Model Pricing

Current Pricing

Feature	Price (per 1M tokens)
Prompt	$0.9
Completion	$0.9

Price History

Available Endpoints

Provider	Endpoint Name	Context Length	Pricing (Input)	Pricing (Output)
Together	Together \| qwen/qwen-2-72b-instruct	32K	$0.9 / 1M tokens	$0.9 / 1M tokens

Benchmark Results

Benchmark	Category	Reasoning	Strategy	Free	Executions	Accuracy	Cost	Duration

Other Models by qwen

	Released	Params	Context	Filter by Modalities All Modalities	Speed	Ability	Cost
Qwen: Qwen3 VL 32B Instruct Unavailable	Oct 23, 2025	32B	262K	Image input Text input Text output	★★★	★★★★★	$$
Qwen: Qwen3 VL 8B Thinking	Oct 14, 2025	8B	256K	Image input Text input Text output	★	★	$$$$$
Qwen: Qwen3 VL 8B Instruct	Oct 14, 2025	8B	256K	Image input Text input Text output	★	★★	$$$
Qwen: Qwen3 VL 30B A3B Thinking	Oct 06, 2025	30B	262K	Image input Text input Text output	★	★★★	$$$$
Qwen: Qwen3 VL 30B A3B Instruct	Oct 06, 2025	30B	131K	Image input Text input Text output	—	—	$$$
Qwen: Qwen3 VL 235B A22B Thinking	Sep 23, 2025	235B	131K	Image input Text input Text output	★	★	$$$$$
Qwen: Qwen3 VL 235B A22B Instruct	Sep 23, 2025	235B	131K	Image input Text input Text output	★★★	★★★★★	$$$
Qwen: Qwen3 Max	Sep 23, 2025	—	256K	Text input Text output	★★★★	★★★★★	$$$$
Qwen: Qwen3 Coder Plus	Sep 23, 2025	~480B	128K	Text input Text output	★★★★	★★★★	$$$$
Qwen: Qwen3 Coder Flash	Sep 17, 2025	—	128K	Text input Text output	★★★★	★★★	$$$
Qwen: Qwen3 Next 80B A3B Thinking	Sep 11, 2025	80B	262K	Text input Text output	★	★★★★	$$$$$
Qwen: Qwen3 Next 80B A3B Instruct	Sep 11, 2025	80B	262K	Text input Text output	★★★★	★★★★	$$$$
Qwen: Qwen Plus 0728	Sep 08, 2025	~20B	1M	Text input Text output	★★★★★	★★★	$$$
Qwen: Qwen3 30B A3B Thinking 2507	Aug 28, 2025	30B	262K	Text input Text output	★★	★★★	$$$$
Qwen: Qwen3 Coder 30B A3B Instruct	Jul 31, 2025	30B	200K	Text input Text output	★★★★	★★★	$$
Qwen: Qwen3 30B A3B Instruct 2507	Jul 29, 2025	30B	131K	Text input Text output	★★★★	★★★★	$$$
Qwen: Qwen3 235B A22B Thinking 2507	Jul 25, 2025	235B	131K	Text input Text output	★	★★★★	$$$$$
Qwen: Qwen3 Coder 480B A35B	Jul 22, 2025	480B	1M	Text input Text output	★★	★★★	$$$
Qwen: Qwen3 Coder 480B A35B (exacto)	Jul 22, 2025	480B	262K	Text input Text output	—	—	$$$$
Qwen: Qwen3 235B A22B Instruct 2507	Jul 21, 2025	235B	262K	Text input Text output	★★	★★★	$$$
Qwen: Qwen3 30B A3B	Apr 28, 2025	30B	40K	Text input Text output	★	★★★★★	$$$$
Qwen: Qwen3 8B	Apr 28, 2025	8B	128K	Text input Text output	★	★★★	$$$
Qwen: Qwen3 14B	Apr 28, 2025	14B	40K	Text input Text output	★★	★★★	$$$
Qwen: Qwen3 32B	Apr 28, 2025	32B	40K	Text input Text output	★	★★★★★	$$$
Qwen: Qwen3 235B A22B	Apr 28, 2025	235B	40K	Text input Text output	★	★★★★	$$$$
Qwen: Qwen2.5 Coder 7B Instruct	Apr 15, 2025	7B	32K	Text input Text output	—	—	$
Qwen: Qwen2.5 VL 32B Instruct	Mar 24, 2025	32B	128K	Image input Text input Text output	★	★★★	$$$
Qwen: QwQ 32B	Mar 05, 2025	32B	131K	Text input Text output	★	★★	$$$
Qwen: Qwen VL Plus	Feb 04, 2025	—	7K	Image input Text input Text output	★★★★	★★	$$$
Qwen: Qwen VL Max	Feb 01, 2025	—	131K	Image input Text input Text output	★★★	★★★	$$$$
Qwen: Qwen-Turbo	Feb 01, 2025	—	1M	Text input Text output	★★★★★	★★★★	$$
Qwen: Qwen2.5 VL 72B Instruct	Feb 01, 2025	72B	32K	Image input Text input Text output	★★★★	★★★★	$$
Qwen: Qwen-Plus	Feb 01, 2025	—	131K	Text input Text output	★★★★	★★★★	$$$
Qwen: Qwen-Max	Feb 01, 2025	—	32K	Text input Text output	★★★★	★★★★	$$$$
Qwen: QwQ 32B Preview Unavailable	Nov 27, 2024	32B	32K	Text input Text output	—	★	$$
Qwen2.5 Coder 32B Instruct	Nov 11, 2024	~500B	32K	Text input Text output	★★★★★	★★★★★	$
Qwen: Qwen2.5 7B Instruct	Oct 15, 2024	~500B	32K	Text input Text output	★	★★	$
Qwen2.5 72B Instruct	Sep 18, 2024	~500B	131K	Text input Text output	★★★	★★	$$
Qwen: Qwen2.5-VL 7B Instruct	Aug 27, 2024	~500B	32K	Image input Text input Text output	★★★★	★★	$$