Qwen: Qwen VL Plus

Text input Image input Text output
Author's Description

Qwen's Enhanced Large Visual Language Model. Significantly upgraded for detailed recognition capabilities and text recognition abilities, supporting ultra-high pixel resolutions up to millions of pixels and extreme aspect ratios for...

Key Specifications
Cost
$$$
Context
131K
Released
Feb 04, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Seed Top P Response Format Temperature Presence Penalty Max Tokens
Features

This model supports the following features:

Response Format
Performance Summary

Qwen VL Plus, an enhanced large visual language model from Qwen, demonstrates a balanced performance profile with notable strengths in certain areas. The model performs among the fastest models, ranking in the top tier for speed (69th percentile), and offers competitive pricing, typically providing cost-effective solutions (67th percentile). Its reliability is strong, with an 87% success rate across benchmarks, indicating consistent and usable responses. In terms of specific benchmarks, Qwen VL Plus shows a significant strength in Ethics, achieving 97.0% accuracy, despite a 31st percentile ranking which suggests other models might also perform well in this area. Its ability to handle complex visual inputs, as described, likely contributes to its overall performance. However, the model exhibits weaknesses in core cognitive areas such as Mathematics (44.4% accuracy, 17th percentile), General Knowledge (70.2% accuracy, 18th percentile), and Reasoning (44.0% accuracy, 23rd percentile). Hallucinations are also a concern, with an 84.0% accuracy in acknowledging uncertainty, placing it in the 28th percentile. While its Instruction Following (52.0% accuracy, 46th percentile) and Coding (68.0% accuracy, 26th percentile) capabilities are moderate, there is room for improvement. The model's cost-effectiveness and speed are consistent across most benchmarks, making it an efficient option for tasks where its accuracy aligns with requirements.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.137
Completion $0.409
Input Cache Read $0.0273

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Alibaba
Alibaba | qwen/qwen-vl-plus 131K $0.137 / 1M tokens $0.409 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by qwen