Qwen: Qwen2.5 7B Instruct

Name: Qwen: Qwen2.5 7B Instruct
Brand: qwen
Price: 4e-8 USD
Availability: InStock
Rating: 2.5 (8 reviews)

Back

Text input Text output

Author's Description

Qwen2.5 7B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and mathematics, thanks to our specialized expert models in these domains. - Significant improvements in instruction following, generating long texts (over 8K tokens), understanding structured data (e.g, tables), and generating structured outputs especially JSON. More resilient to the diversity of system prompts, enhancing role-play implementation and condition-setting for chatbots. - Long-context Support up to 128K tokens and can generate up to 8K tokens. - Multilingual support for over 29 languages, including Chinese, English, French, Spanish, Portuguese, German, Italian, Russian, Japanese, Korean, Vietnamese, Thai, Arabic, and more. Usage of this model is subject to [Tongyi Qianwen LICENSE AGREEMENT](https://huggingface.co/Qwen/Qwen1.5-110B-Chat/blob/main/LICENSE).

Key Specifications

Cost

Context

32K

Parameters

500B (Rumoured)

Released

Oct 15, 2024

Speed

★

Ability

★★

Reliability

★★

Hugging Face

Supported Parameters

This model supports the following parameters:

Top P Temperature Stop Structured Outputs Response Format Max Tokens Presence Penalty Frequency Penalty

Features

This model supports the following features:

Structured Outputs Response Format

Performance Summary

Qwen2.5 7B Instruct demonstrates moderate speed performance, ranking in the 28th percentile across benchmarks. It consistently offers among the most competitive pricing, placing in the 93rd percentile. The model exhibits strong reliability with a 91% success rate, indicating few technical issues. Key strengths include exceptional performance in hallucination avoidance (98.0% accuracy), where it is the most accurate model at its price point. It also shows strong capabilities in coding (83.0% accuracy) and mathematics (80.5% accuracy), with the latter also being the most accurate model at its price point. These improvements align with the provider's claims of enhanced coding and mathematics capabilities due to specialized expert models. The model also performs well in email classification, achieving 94.0% accuracy. However, Qwen2.5 7B Instruct shows notable weaknesses in General Knowledge (76.5% accuracy) and Ethics (61.0% accuracy), where it ranks in the lower percentiles. Instruction Following (45.5% accuracy) and Reasoning (56.8% accuracy) also present areas for improvement. Despite its long-context support, some benchmarks, particularly General Knowledge, Ethics, and Mathematics, show very long durations, suggesting potential inefficiencies in processing certain types of tasks.

Model Pricing

Current Pricing

Feature	Price (per 1M tokens)
Prompt	$0.04
Completion	$0.1

Price History

Available Endpoints

Provider	Endpoint Name	Context Length	Pricing (Input)	Pricing (Output)
NextBit	NextBit \| qwen/qwen-2.5-7b-instruct	32K	$0.04 / 1M tokens	$0.1 / 1M tokens
DeepInfra	DeepInfra \| qwen/qwen-2.5-7b-instruct	32K	$0.04 / 1M tokens	$0.1 / 1M tokens
Phala	Phala \| qwen/qwen-2.5-7b-instruct	32K	$0.04 / 1M tokens	$0.1 / 1M tokens
Together	Together \| qwen/qwen-2.5-7b-instruct	32K	$0.3 / 1M tokens	$0.3 / 1M tokens
Novita	Novita \| qwen/qwen-2.5-7b-instruct	32K	$0.04 / 1M tokens	$0.1 / 1M tokens
NextBit	NextBit \| qwen/qwen-2.5-7b-instruct	65K	$0.04 / 1M tokens	$0.1 / 1M tokens
Novita	Novita \| qwen/qwen-2.5-7b-instruct	32K	$0.04 / 1M tokens	$0.1 / 1M tokens

Benchmark Results

Benchmark	Category	Reasoning	Strategy	Free	Executions	Accuracy	Cost	Duration

Other Models by qwen

	Released	Params	Context	Filter by Modalities All Modalities	Speed	Ability	Cost
Qwen: Qwen3 VL 32B Instruct	Oct 23, 2025	32B	262K	Text input Image input Text output	★★★	★★★★★	$$
Qwen: Qwen3 VL 8B Thinking	Oct 14, 2025	8B	256K	Text input Image input Text output	★	★	$$$$$
Qwen: Qwen3 VL 8B Instruct	Oct 14, 2025	8B	256K	Text input Image input Text output	★	★★	$$$
Qwen: Qwen3 VL 30B A3B Thinking	Oct 06, 2025	30B	262K	Text input Image input Text output	★	★★★	$$$$
Qwen: Qwen3 VL 30B A3B Instruct	Oct 06, 2025	30B	131K	Text input Image input Text output	—	—	$$$
Qwen: Qwen3 VL 235B A22B Thinking	Sep 23, 2025	235B	131K	Text input Image input Text output	★	★	$$$$$
Qwen: Qwen3 VL 235B A22B Instruct	Sep 23, 2025	235B	131K	Text input Image input Text output	★★★	★★★★★	$$$
Qwen: Qwen3 Max	Sep 23, 2025	—	256K	Text input Text output	★★★★	★★★★★	$$$$
Qwen: Qwen3 Coder Plus	Sep 23, 2025	~480B	128K	Text input Text output	★★★★	★★★★	$$$$
Qwen: Qwen3 Coder Flash	Sep 17, 2025	—	128K	Text input Text output	★★★★	★★★	$$$
Qwen: Qwen3 Next 80B A3B Thinking	Sep 11, 2025	80B	262K	Text input Text output	★	★★★★	$$$$$
Qwen: Qwen3 Next 80B A3B Instruct	Sep 11, 2025	80B	262K	Text input Text output	★★★★	★★★★	$$$$
Qwen: Qwen Plus 0728	Sep 08, 2025	~20B	1M	Text input Text output	★★★★★	★★★	$$$
Qwen: Qwen3 30B A3B Thinking 2507	Aug 28, 2025	30B	262K	Text input Text output	★★	★★★	$$$$
Qwen: Qwen3 Coder 30B A3B Instruct	Jul 31, 2025	30B	262K	Text input Text output	★★★★	★★★	$$
Qwen: Qwen3 30B A3B Instruct 2507	Jul 29, 2025	30B	131K	Text input Text output	★★★★	★★★★	$$$
Qwen: Qwen3 235B A22B Thinking 2507	Jul 25, 2025	235B	131K	Text input Text output	★	★★★★	$$$$$
Qwen: Qwen3 Coder 480B A35B	Jul 22, 2025	480B	1M	Text input Text output	★★	★★★	$$$
Qwen: Qwen3 Coder 480B A35B (exacto)	Jul 22, 2025	480B	262K	Text input Text output	—	—	$$$$
Qwen: Qwen3 235B A22B Instruct 2507	Jul 21, 2025	235B	262K	Text input Text output	★★	★★★	$$$
Qwen: Qwen3 30B A3B	Apr 28, 2025	30B	40K	Text input Text output	★	★★★★★	$$$$
Qwen: Qwen3 8B	Apr 28, 2025	8B	128K	Text input Text output	★	★★★	$$$
Qwen: Qwen3 14B	Apr 28, 2025	14B	40K	Text input Text output	★★	★★★	$$$
Qwen: Qwen3 32B	Apr 28, 2025	32B	40K	Text input Text output	★	★★★★	$$$
Qwen: Qwen3 235B A22B	Apr 28, 2025	235B	40K	Text input Text output	★	★★★★	$$$$
Qwen: Qwen2.5 Coder 7B Instruct	Apr 15, 2025	7B	32K	Text input Text output	—	—	$
Qwen: Qwen2.5 VL 32B Instruct	Mar 24, 2025	32B	128K	Text input Image input Text output	★	★★★	$$$
Qwen: QwQ 32B	Mar 05, 2025	32B	131K	Text input Text output	★	★★	$$$
Qwen: Qwen VL Plus	Feb 04, 2025	—	7K	Text input Image input Text output	★★★★	★★	$$$
Qwen: Qwen VL Max	Feb 01, 2025	—	131K	Text input Image input Text output	★★★	★★★	$$$$
Qwen: Qwen-Turbo	Feb 01, 2025	—	1M	Text input Text output	★★★★★	★★★★	$$
Qwen: Qwen2.5 VL 72B Instruct	Feb 01, 2025	72B	32K	Text input Image input Text output	★★★★	★★★★	$$
Qwen: Qwen-Plus	Feb 01, 2025	—	131K	Text input Text output	★★★★	★★★★	$$$
Qwen: Qwen-Max	Feb 01, 2025	—	32K	Text input Text output	★★★★	★★★★	$$$$
Qwen: QwQ 32B Preview Unavailable	Nov 27, 2024	32B	32K	Text input Text output	—	★	$$
Qwen2.5 Coder 32B Instruct	Nov 11, 2024	~500B	32K	Text input Text output	★★★★★	★★★★★	$
Qwen2.5 72B Instruct	Sep 18, 2024	~500B	32K	Text input Text output	★★★	★★	$$$
Qwen: Qwen2.5-VL 7B Instruct	Aug 27, 2024	~500B	32K	Text input Image input Text output	★★★★	★★	$$
Qwen 2 72B Instruct Unavailable	Jun 06, 2024	~500B	32K	Text input Text output	★★★★	★★	$$$$