Qwen: Qwen3 VL 8B Thinking

Text input Image input Text output
Author's Description

Qwen3-VL-8B-Thinking is the reasoning-optimized variant of the Qwen3-VL-8B multimodal model, designed for advanced visual and textual reasoning across complex scenes, documents, and temporal sequences. It integrates enhanced multimodal alignment and...

Key Specifications
Cost
$$$$$
Context
131K
Parameters
8B
Released
Oct 14, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Tools Tool Choice Temperature Include Reasoning Reasoning Presence Penalty Max Tokens Seed Structured Outputs Response Format Top P
Features

This model supports the following features:

Structured Outputs Reasoning Response Format Tools
Performance Summary

Qwen3-VL-8B-Thinking, a reasoning-optimized multimodal model, exhibits a tendency towards longer response times, ranking in the 13th percentile for speed. Its pricing is moderate, placing it in the 34th percentile. The model demonstrates a notable strength in Email Classification, achieving 98.0% accuracy, indicating robust understanding of context and purpose in this domain. However, its performance across other benchmarks is generally low. It struggles significantly with General Knowledge (3.0% accuracy), Ethics (13.0% accuracy), and Mathematics (18.0% accuracy), despite its design for advanced reasoning. While it shows a 74.0% accuracy in avoiding hallucinations, its Instruction Following (60.0%) and Reasoning (20.4%) capabilities, which are core to its "Thinking" variant, are below average. Coding performance is also weak at 27.1%. The model's extended context length and multimodal fusion are promising, but current benchmark results suggest substantial room for improvement in complex reasoning and knowledge-based tasks.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.117
Completion $1.36

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Alibaba
Alibaba | qwen/qwen3-vl-8b-thinking 131K $0.117 / 1M tokens $1.36 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by qwen