Qwen: Qwen3 VL 8B Instruct

Text input Image input Text output
Author's Description

Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built for high-fidelity understanding and reasoning across text, images, and video. It features improved multimodal fusion with Interleaved-MRoPE for long-horizon...

Key Specifications
Cost
$$$
Context
131K
Parameters
8B
Released
Oct 14, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Tools Tool Choice Temperature Presence Penalty Max Tokens Seed Structured Outputs Response Format Top P
Features

This model supports the following features:

Structured Outputs Response Format Tools
Performance Summary

Qwen3-VL-8B-Instruct demonstrates moderate speed performance, ranking in the 20th percentile across benchmarks, and offers cost-effective solutions, placing it in the 67th percentile for pricing. Notably, its reliability is exceptional, achieving a 98% success rate, indicating consistent and stable operation. The model exhibits strong performance in Ethics (99.0% accuracy) and General Knowledge (96.0% accuracy), suggesting robust understanding in these domains. Its Coding capabilities are also commendable at 87.1% accuracy. However, a significant weakness is observed in Mathematics, with a low 42.5% accuracy, and in Hallucinations, where it only achieved 58.0% accuracy, indicating a tendency to generate incorrect information rather than acknowledging uncertainty. Reasoning and Instruction Following show moderate performance at 65.1% and 56.3% accuracy, respectively. The model's extended context window and multimodal fusion capabilities are key features, though its performance on specific benchmarks highlights areas for improvement, particularly in mathematical precision and managing uncertainty.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.117
Completion $0.455

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Alibaba
Alibaba | qwen/qwen3-vl-8b-instruct 131K $0.117 / 1M tokens $0.455 / 1M tokens
DeepInfra
DeepInfra | qwen/qwen3-vl-8b-instruct 256K $0.08 / 1M tokens $0.5 / 1M tokens
DeepInfra
DeepInfra | qwen/qwen3-vl-8b-instruct 262K $0.08 / 1M tokens $0.5 / 1M tokens
Novita
Novita | qwen/qwen3-vl-8b-instruct 131K $0.08 / 1M tokens $0.5 / 1M tokens
Parasail
Parasail | qwen/qwen3-vl-8b-instruct 262K $0.25 / 1M tokens $0.75 / 1M tokens
NextBit
NextBit | qwen/qwen3-vl-8b-instruct 131K $0.08 / 1M tokens $0.5 / 1M tokens
Together
Together | qwen/qwen3-vl-8b-instruct 262K $0.08 / 1M tokens $0.5 / 1M tokens
AtlasCloud
AtlasCloud | qwen/qwen3-vl-8b-instruct 128K $0.08 / 1M tokens $0.5 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by qwen