Qwen: Qwen VL Plus

Text input Image input Text output
Author's Description

Qwen's Enhanced Large Visual Language Model. Significantly upgraded for detailed recognition capabilities and text recognition abilities, supporting ultra-high pixel resolutions up to millions of pixels and extreme aspect ratios for image input. It delivers significant performance across a broad range of visual tasks.

Key Specifications
Cost
$$$
Context
7K
Released
Feb 04, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Presence Penalty Top P Temperature Seed Response Format Max Tokens
Features

This model supports the following features:

Response Format
Performance Summary

Qwen VL Plus, an enhanced large visual language model from Qwen, demonstrates a strong overall performance profile, particularly excelling in reliability. Created on February 4, 2025, this model is designed for detailed recognition and text recognition, supporting ultra-high pixel resolutions and extreme aspect ratios. It consistently provides usable responses, ranking in the 91st percentile for reliability across six benchmarks, indicating very few technical issues. In terms of speed, Qwen VL Plus performs among the fastest models, typically in the top tier (73rd percentile). It also offers cost-effective solutions, ranking in the 66th percentile for price competitiveness. While strong in reliability, speed, and cost, the model exhibits varied performance across specific benchmarks. It shows notable accuracy in Ethics (97.0%) and Email Classification (93.0%), though its percentile ranks in these categories (37th and 27th respectively) suggest a competitive landscape. Its Instruction Following (52.0% accuracy, 56th percentile) and Reasoning (56.0% accuracy, 45th percentile) capabilities are moderate. A key area for improvement is General Knowledge, where it achieved 70.2% accuracy but ranked in the 22nd percentile, and Coding, with 68.0% accuracy and a 33rd percentile ranking. The model's duration for these tasks can also be lengthy, particularly for General Knowledge. Overall, Qwen VL Plus is a highly reliable and cost-efficient model with strong visual language capabilities, but its general knowledge and coding performance could be enhanced.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.21
Completion $0.63

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Alibaba
Alibaba | qwen/qwen-vl-plus 7K $0.21 / 1M tokens $0.63 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Free Executions Accuracy Cost Duration
Other Models by qwen