Qwen: Qwen2.5-VL 7B Instruct

Name: Qwen: Qwen2.5-VL 7B Instruct
Brand: qwen
Availability: OutOfStock
Rating: 2.0 (8 reviews)

Back

Image input Text input Text output Unavailable

Author's Description

Qwen2.5 VL 7B is a multimodal LLM from the Qwen Team with the following key enhancements: - SoTA understanding of images of various resolution & ratio: Qwen2.5-VL achieves state-of-the-art performance on visual understanding benchmarks, including MathVista, DocVQA, RealWorldQA, MTVQA, etc. - Understanding videos of 20min+: Qwen2.5-VL can understand videos over 20 minutes for high-quality video-based question answering, dialog, content creation, etc. - Agent that can operate your mobiles, robots, etc.: with the abilities of complex reasoning and decision making, Qwen2.5-VL can be integrated with devices like mobile phones, robots, etc., for automatic operation based on visual environment and text instructions. - Multilingual Support: to serve global users, besides English and Chinese, Qwen2.5-VL now supports the understanding of texts in different languages inside images, including most European languages, Japanese, Korean, Arabic, Vietnamese, etc. For more details, see this [blog post](https://qwenlm.github.io/blog/qwen2-vl/) and [GitHub repo](https://github.com/QwenLM/Qwen2-VL). Usage of this model is subject to [Tongyi Qianwen LICENSE AGREEMENT](https://huggingface.co/Qwen/Qwen1.5-110B-Chat/blob/main/LICENSE).

Key Specifications

Cost

Context

32K

Parameters

500B (Rumoured)

Released

Aug 27, 2024

Speed

★★★★

Ability

★★

Reliability

★

Hugging Face

Supported Parameters

This model supports the following parameters:

Stop Max Tokens Logit Bias Seed Top P Min P Frequency Penalty Presence Penalty Temperature

Performance Summary

Qwen2.5-VL 7B Instruct demonstrates exceptional speed, consistently ranking among the fastest models across various benchmarks. It also offers competitive pricing, typically providing cost-effective solutions. The model exhibits strong reliability with an 86% success rate, indicating consistent delivery of usable responses. In terms of performance across categories, Qwen2.5-VL 7B Instruct achieves perfect accuracy in Ethics, highlighting its robust moral reasoning capabilities. It also performs well in General Knowledge (91.8% accuracy) and Email Classification (92.0% accuracy). A significant strength lies in its multimodal capabilities, as described, including state-of-the-art image understanding across resolutions and ratios, and the ability to comprehend videos over 20 minutes. Its multilingual support for text within images is also a notable advantage for global applications. However, the model shows significant weaknesses in Mathematics, scoring 0.0% accuracy, suggesting a current limitation in complex mathematical problem-solving. Its performance in Reasoning (42.0% accuracy) and Instruction Following (51.5% accuracy) is moderate, indicating areas for potential improvement. While its Hallucinations accuracy is 86.0%, this places it in the 29th percentile, suggesting room for improvement in acknowledging uncertainty for fictional concepts.

Model Pricing

Current Pricing

Feature	Price (per 1M tokens)
Prompt	$0.2
Completion	$0.2

Price History

Available Endpoints

Provider	Endpoint Name	Context Length	Pricing (Input)	Pricing (Output)
Hyperbolic	Hyperbolic \| qwen/qwen-2-vl-7b-instruct	32K	$0.2 / 1M tokens	$0.2 / 1M tokens
InferenceNet	InferenceNet \| qwen/qwen-2-vl-7b-instruct	128K	$0.2 / 1M tokens	$0.2 / 1M tokens
Kluster	Kluster \| qwen/qwen-2-vl-7b-instruct	32K	$0.2 / 1M tokens	$0.2 / 1M tokens

Benchmark Results

Benchmark	Category	Reasoning	Strategy	Free	Executions	Accuracy	Cost	Duration

Other Models by qwen

	Released	Params	Context	Filter by Modalities All Modalities	Speed	Ability	Cost
Qwen: Qwen3.7 Plus Unavailable	Jun 03, 2026	—	1M	Image input Text input Text output	★	★★★★	$$$$$
Qwen: Qwen3.7 Plus	Jun 03, 2026	—	1M	Image input Text input Text output	★	★★★★	$$$$$
Qwen: Qwen3.7 Max	May 21, 2026	—	1M	Text input Text output	★★	★★★★★	$$$$$
Qwen: Qwen3.5 Plus 2026-04-20	Apr 26, 2026	—	1M	Image input Text input Video input Text output	★	★★★★	$$$$$
Qwen: Qwen3.6 Flash	Apr 26, 2026	—	1M	Image input Text input Video input Text output	★★	★	$$$$$
Qwen: Qwen3.6 35B A3B	Apr 26, 2026	35B	262K	Image input Text input Video input Text output	★★	★★★★	$$$$$
Qwen: Qwen3.6 Max Preview	Apr 26, 2026	~1T	262K	Text input Text output	★	★★★★★	$$$$$
Qwen: Qwen3.6 27B	Apr 26, 2026	27B	262K	Image input Text input Video input Text output	★	★★★	$$$$$
Qwen: Qwen3.6 Plus	Apr 02, 2026	—	1M	Image input Text input Video input Text output	★	★	$$$$$
Qwen: Qwen3.5-9B	Mar 10, 2026	9B	262K	Image input Text input Video input Text output	★	★★	$$$$
Qwen: Qwen3.5-35B-A3B	Feb 25, 2026	35B	262K	Image input Text input Video input Text output	★	★	$$$$$
Qwen: Qwen3.5-27B	Feb 25, 2026	27B	262K	Image input Text input Video input Text output	★	★	$$$$$
Qwen: Qwen3.5-122B-A10B	Feb 25, 2026	122B	262K	Image input Text input Video input Text output	★	★	$$$$$
Qwen: Qwen3.5-Flash	Feb 25, 2026	—	1M	Image input Text input Video input Text output	★★	★	$$$$
Qwen: Qwen3.5 Plus 2026-02-15	Feb 16, 2026	—	1M	Image input Text input Video input Text output	★★★★	★	$$$
Qwen: Qwen3.5 397B A17B	Feb 15, 2026	397B	262K	Image input Text input Video input Text output	★	★★★★★	$$$$$
Qwen: Qwen3 Max Thinking	Feb 09, 2026	—	262K	Text input Text output	★★	★★★★	$$$$$
Qwen: Qwen3 Coder Next	Feb 03, 2026	~80B	262K	Text input Text output	★	★★★★	$$$$
Qwen: Qwen3 VL 32B Instruct	Oct 23, 2025	32B	262K	Image input Text input Text output	★★★	★★★★★	$$
Qwen: Qwen3 VL 8B Thinking	Oct 14, 2025	8B	131K	Image input Text input Text output	★	★	$$$$$
Qwen: Qwen3 VL 8B Instruct	Oct 14, 2025	8B	131K	Image input Text input Text output	★	★★	$$$
Qwen: Qwen3 VL 30B A3B Thinking	Oct 06, 2025	30B	262K	Image input Text input Text output	★	★★★	$$$$
Qwen: Qwen3 VL 30B A3B Instruct	Oct 06, 2025	30B	131K	Image input Text input Text output	—	—	$$$
Qwen: Qwen3 VL 235B A22B Thinking	Sep 23, 2025	235B	131K	Image input Text input Text output	★	★	$$$$$
Qwen: Qwen3 VL 235B A22B Instruct	Sep 23, 2025	235B	131K	Image input Text input Text output	★★★★	★★★★	$$$
Qwen: Qwen3 Max	Sep 23, 2025	—	262K	Text input Text output	★★★★	★★★★	$$$$
Qwen: Qwen3 Coder Plus	Sep 23, 2025	~480B	1M	Text input Text output	★★★★	★★★★	$$$$
Qwen: Qwen3 Coder Flash	Sep 17, 2025	—	1M	Text input Text output	★★★★	★★★	$$$
Qwen: Qwen3 Next 80B A3B Thinking	Sep 11, 2025	80B	131K	Text input Text output	★	★★★	$$$$$
Qwen: Qwen3 Next 80B A3B Instruct	Sep 11, 2025	80B	262K	Text input Text output	★★★★	★★★★	$$$$
Qwen: Qwen Plus 0728	Sep 08, 2025	~20B	1M	Text input Text output	★★★★★	★★★	$$
Qwen: Qwen3 30B A3B Thinking 2507	Aug 28, 2025	30B	262K	Text input Text output	★★	★★★	$$$
Qwen: Qwen3 Coder 30B A3B Instruct	Jul 31, 2025	30B	262K	Text input Text output	★★★★	★★	$$
Qwen: Qwen3 30B A3B Instruct 2507	Jul 29, 2025	30B	131K	Text input Text output	★★★★	★★★	$$$
Qwen: Qwen3 235B A22B Thinking 2507	Jul 25, 2025	235B	131K	Text input Text output	★	★★★★	$$$$$
Qwen: Qwen3 Coder 480B A35B	Jul 22, 2025	480B	1M	Text input Text output	★★	★★★	$$$
Qwen: Qwen3 Coder 480B A35B (exacto) Unavailable	Jul 22, 2025	480B	262K	Text input Text output	—	—	$$$$
Qwen: Qwen3 235B A22B Instruct 2507	Jul 21, 2025	235B	262K	Text input Text output	★★	★★★	$$$
Qwen: Qwen3 4B Unavailable	Apr 30, 2025	4B	131K	Text input Text output	★★	★★★	$$$
Qwen: Qwen3 30B A3B	Apr 28, 2025	30B	40K	Text input Text output	★★	★★★★	$$$
Qwen: Qwen3 8B	Apr 28, 2025	8B	128K	Text input Text output	★	★★	$$$
Qwen: Qwen3 14B	Apr 28, 2025	14B	40K	Text input Text output	★★	★★★	$$$
Qwen: Qwen3 32B	Apr 28, 2025	32B	40K	Text input Text output	★	★★★★	$$$
Qwen: Qwen3 235B A22B	Apr 28, 2025	235B	40K	Text input Text output	★	★★★★	$$$$
Qwen: Qwen2.5 Coder 7B Instruct Unavailable	Apr 15, 2025	7B	32K	Text input Text output	—	—	$
Qwen: Qwen2.5 VL 32B Instruct Unavailable	Mar 24, 2025	32B	128K	Image input Text input Text output	★	★★★	$$$
Qwen: QwQ 32B Unavailable	Mar 05, 2025	32B	131K	Text input Text output	★	★★	$$$
Qwen: Qwen VL Plus Unavailable	Feb 04, 2025	—	131K	Image input Text input Text output	★★★★	★★	$$$
Qwen: Qwen VL Max Unavailable	Feb 01, 2025	—	131K	Image input Text input Text output	★★★★	★★	$$$$
Qwen: Qwen-Turbo Unavailable	Feb 01, 2025	—	131K	Text input Text output	★★★★★	★★★★	$$
Qwen: Qwen2.5 VL 72B Instruct	Feb 01, 2025	72B	32K	Image input Text input Text output	★★★★	★★★★	$$
Qwen: Qwen-Plus	Feb 01, 2025	—	1M	Text input Text output	★★★★	★★★★	$$$
Qwen: Qwen-Max Unavailable	Feb 01, 2025	—	32K	Text input Text output	★★★★	★★★★	$$$$
Qwen: QwQ 32B Preview Unavailable	Nov 27, 2024	32B	32K	Text input Text output	—	★	$$
Qwen2.5 Coder 32B Instruct	Nov 11, 2024	~500B	32K	Text input Text output	★★★★★	★★★★★	$
Qwen: Qwen2.5 7B Instruct	Oct 15, 2024	~500B	32K	Text input Text output	★	★★	$
Qwen2.5 72B Instruct	Sep 18, 2024	~500B	32K	Text input Text output	★★★	★★	$$
Qwen 2 72B Instruct Unavailable	Jun 06, 2024	~500B	32K	Text input Text output	★★★★	★★	$$$$