Qwen2.5 72B Instruct

Name: Qwen2.5 72B Instruct
Brand: qwen
Price: 7e-8 USD
Availability: InStock
Rating: 2.6 (8 reviews)

Back

Text input Text output Free Option

Author's Description

Qwen2.5 72B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and mathematics, thanks to our specialized expert models in these domains. - Significant improvements in instruction following, generating long texts (over 8K tokens), understanding structured data (e.g, tables), and generating structured outputs especially JSON. More resilient to the diversity of system prompts, enhancing role-play implementation and condition-setting for chatbots. - Long-context Support up to 128K tokens and can generate up to 8K tokens. - Multilingual support for over 29 languages, including Chinese, English, French, Spanish, Portuguese, German, Italian, Russian, Japanese, Korean, Vietnamese, Thai, Arabic, and more. Usage of this model is subject to [Tongyi Qianwen LICENSE AGREEMENT](https://huggingface.co/Qwen/Qwen1.5-110B-Chat/blob/main/LICENSE).

Key Specifications

Cost

Context

131K

Parameters

500B (Rumoured)

Released

Sep 18, 2024

Speed

★★★

Ability

★★

Reliability

★★★

Hugging Face

Supported Parameters

This model supports the following parameters:

Stop Presence Penalty Max Tokens Frequency Penalty Tool Choice Top P Seed Tools Temperature Response Format Min P

Features

This model supports the following features:

Response Format Tools

Performance Summary

Qwen2.5 72B Instruct, released on September 18, 2024, demonstrates exceptional performance across several key metrics. It consistently ranks among the fastest models available and offers competitive pricing, typically providing cost-effective solutions. The model exhibits outstanding reliability with a 99% success rate, indicating minimal technical failures. In terms of capabilities, Qwen2.5 72B shows perfect accuracy in Hallucinations (Baseline) and Ethics (Baseline) benchmarks, highlighting its strong ability to acknowledge uncertainty and adhere to ethical principles. Its General Knowledge is robust at 99.5% accuracy. The model also performs well in Coding (85.0% accuracy) and Reasoning (74.0% accuracy). A significant strength is its improved instruction following, achieving 67.0% accuracy in the more complex benchmark, a notable improvement over its predecessor. It also boasts strong multilingual support and enhanced long-context capabilities up to 128K tokens. While its Mathematics performance at 83.8% is solid, it is not a top-tier result compared to other benchmarks. The initial Instruction Following (Baseline) benchmark showing 0.0% accuracy appears to be an anomaly or a misinterpretation of the test, as the subsequent, more detailed Instruction Following benchmark shows a strong 67.0% accuracy.

Model Pricing

Current Pricing

Feature	Price (per 1M tokens)
Prompt	$0.07
Completion	$0.26

Price History

Available Endpoints

Provider	Endpoint Name	Context Length	Pricing (Input)	Pricing (Output)
DeepInfra	DeepInfra \| qwen/qwen-2.5-72b-instruct	32K	$0.12 / 1M tokens	$0.39 / 1M tokens
Nebius	Nebius \| qwen/qwen-2.5-72b-instruct	131K	$0.07 / 1M tokens	$0.26 / 1M tokens
Novita	Novita \| qwen/qwen-2.5-72b-instruct	32K	$0.07 / 1M tokens	$0.26 / 1M tokens
Hyperbolic	Hyperbolic \| qwen/qwen-2.5-72b-instruct	131K	$0.4 / 1M tokens	$0.4 / 1M tokens
Fireworks	Fireworks \| qwen/qwen-2.5-72b-instruct	32K	$0.07 / 1M tokens	$0.26 / 1M tokens
Together	Together \| qwen/qwen-2.5-72b-instruct	131K	$1.2 / 1M tokens	$1.2 / 1M tokens
Chutes	Chutes \| qwen/qwen-2.5-72b-instruct	32K	$0.07 / 1M tokens	$0.26 / 1M tokens
NextBit	NextBit \| qwen/qwen-2.5-72b-instruct	65K	$0.07 / 1M tokens	$0.26 / 1M tokens
Chutes	Chutes \| qwen/qwen-2.5-72b-instruct	32K	$0.07 / 1M tokens	$0.26 / 1M tokens
Novita	Novita \| qwen/qwen-2.5-72b-instruct	32K	$0.38 / 1M tokens	$0.4 / 1M tokens

Benchmark Results

Benchmark	Category	Reasoning	Strategy	Free	Executions	Accuracy	Cost	Duration

Other Models by qwen

	Released	Params	Context	Filter by Modalities All Modalities	Speed	Ability	Cost
Qwen: Qwen3 VL 32B Instruct Unavailable	Oct 23, 2025	32B	262K	Image input Text input Text output	★★★	★★★★★	$$
Qwen: Qwen3 VL 8B Thinking	Oct 14, 2025	8B	256K	Image input Text input Text output	★	★	$$$$$
Qwen: Qwen3 VL 8B Instruct	Oct 14, 2025	8B	256K	Image input Text input Text output	★	★★	$$$
Qwen: Qwen3 VL 30B A3B Thinking	Oct 06, 2025	30B	262K	Image input Text input Text output	★	★★★	$$$$
Qwen: Qwen3 VL 30B A3B Instruct	Oct 06, 2025	30B	131K	Image input Text input Text output	—	—	$$$
Qwen: Qwen3 VL 235B A22B Thinking	Sep 23, 2025	235B	131K	Image input Text input Text output	★	★	$$$$$
Qwen: Qwen3 VL 235B A22B Instruct	Sep 23, 2025	235B	131K	Image input Text input Text output	★★★	★★★★★	$$$
Qwen: Qwen3 Max	Sep 23, 2025	—	256K	Text input Text output	★★★★	★★★★★	$$$$
Qwen: Qwen3 Coder Plus	Sep 23, 2025	~480B	128K	Text input Text output	★★★★	★★★★	$$$$
Qwen: Qwen3 Coder Flash	Sep 17, 2025	—	128K	Text input Text output	★★★★	★★★	$$$
Qwen: Qwen3 Next 80B A3B Thinking	Sep 11, 2025	80B	262K	Text input Text output	★	★★★★	$$$$$
Qwen: Qwen3 Next 80B A3B Instruct	Sep 11, 2025	80B	262K	Text input Text output	★★★★	★★★★	$$$$
Qwen: Qwen Plus 0728	Sep 08, 2025	~20B	1M	Text input Text output	★★★★★	★★★	$$$
Qwen: Qwen3 30B A3B Thinking 2507	Aug 28, 2025	30B	262K	Text input Text output	★★	★★★	$$$$
Qwen: Qwen3 Coder 30B A3B Instruct	Jul 31, 2025	30B	200K	Text input Text output	★★★★	★★★	$$
Qwen: Qwen3 30B A3B Instruct 2507	Jul 29, 2025	30B	131K	Text input Text output	★★★★	★★★★	$$$
Qwen: Qwen3 235B A22B Thinking 2507	Jul 25, 2025	235B	131K	Text input Text output	★	★★★★	$$$$$
Qwen: Qwen3 Coder 480B A35B	Jul 22, 2025	480B	1M	Text input Text output	★★	★★★	$$$
Qwen: Qwen3 Coder 480B A35B (exacto)	Jul 22, 2025	480B	262K	Text input Text output	—	—	$$$$
Qwen: Qwen3 235B A22B Instruct 2507	Jul 21, 2025	235B	262K	Text input Text output	★★	★★★	$$$
Qwen: Qwen3 30B A3B	Apr 28, 2025	30B	40K	Text input Text output	★	★★★★★	$$$$
Qwen: Qwen3 8B	Apr 28, 2025	8B	128K	Text input Text output	★	★★★	$$$
Qwen: Qwen3 14B	Apr 28, 2025	14B	40K	Text input Text output	★★	★★★	$$$
Qwen: Qwen3 32B	Apr 28, 2025	32B	40K	Text input Text output	★	★★★★★	$$$
Qwen: Qwen3 235B A22B	Apr 28, 2025	235B	40K	Text input Text output	★	★★★★	$$$$
Qwen: Qwen2.5 Coder 7B Instruct	Apr 15, 2025	7B	32K	Text input Text output	—	—	$
Qwen: Qwen2.5 VL 32B Instruct	Mar 24, 2025	32B	128K	Image input Text input Text output	★	★★★	$$$
Qwen: QwQ 32B	Mar 05, 2025	32B	131K	Text input Text output	★	★★	$$$
Qwen: Qwen VL Plus	Feb 04, 2025	—	7K	Image input Text input Text output	★★★★	★★	$$$
Qwen: Qwen VL Max	Feb 01, 2025	—	131K	Image input Text input Text output	★★★	★★★	$$$$
Qwen: Qwen-Turbo	Feb 01, 2025	—	1M	Text input Text output	★★★★★	★★★★	$$
Qwen: Qwen2.5 VL 72B Instruct	Feb 01, 2025	72B	32K	Image input Text input Text output	★★★★	★★★★	$$
Qwen: Qwen-Plus	Feb 01, 2025	—	131K	Text input Text output	★★★★	★★★★	$$$
Qwen: Qwen-Max	Feb 01, 2025	—	32K	Text input Text output	★★★★	★★★★	$$$$
Qwen: QwQ 32B Preview Unavailable	Nov 27, 2024	32B	32K	Text input Text output	—	★	$$
Qwen2.5 Coder 32B Instruct	Nov 11, 2024	~500B	32K	Text input Text output	★★★★★	★★★★★	$
Qwen: Qwen2.5 7B Instruct	Oct 15, 2024	~500B	32K	Text input Text output	★	★★	$
Qwen: Qwen2.5-VL 7B Instruct	Aug 27, 2024	~500B	32K	Image input Text input Text output	★★★★	★★	$$
Qwen 2 72B Instruct Unavailable	Jun 06, 2024	~500B	32K	Text input Text output	★★★★	★★	$$$$