Qwen: Qwen3 VL 30B A3B Instruct

Image input Text input Text output
Author's Description

Qwen3-VL-30B-A3B-Instruct is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Instruct variant optimizes instruction-following for general multimodal tasks. It excels in perception...

Key Specifications
Cost
$$$
Context
131K
Parameters
30B
Released
Oct 06, 2025
Supported Parameters

This model supports the following parameters:

Response Format Top P Seed Temperature Stop Max Tokens Structured Outputs Tool Choice Tools Logit Bias Min P Frequency Penalty Presence Penalty
Features

This model supports the following features:

Structured Outputs Tools Response Format
Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.13
Completion $0.52

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Parasail
Parasail | qwen/qwen3-vl-30b-a3b-instruct 131K $0.13 / 1M tokens $0.52 / 1M tokens
SiliconFlow
SiliconFlow | qwen/qwen3-vl-30b-a3b-instruct 262K $0.29 / 1M tokens $1 / 1M tokens
DeepInfra
DeepInfra | qwen/qwen3-vl-30b-a3b-instruct 0 $0.13 / 1M tokens $0.52 / 1M tokens
DeepInfra
DeepInfra | qwen/qwen3-vl-30b-a3b-instruct 262K $0.15 / 1M tokens $0.6 / 1M tokens
Novita
Novita | qwen/qwen3-vl-30b-a3b-instruct 131K $0.2 / 1M tokens $0.7 / 1M tokens
Fireworks
Fireworks | qwen/qwen3-vl-30b-a3b-instruct 262K $0.13 / 1M tokens $0.52 / 1M tokens
Phala
Phala | qwen/qwen3-vl-30b-a3b-instruct 128K $0.2 / 1M tokens $0.7 / 1M tokens
Alibaba
Alibaba | qwen/qwen3-vl-30b-a3b-instruct 131K $0.13 / 1M tokens $0.52 / 1M tokens
AtlasCloud
AtlasCloud | qwen/qwen3-vl-30b-a3b-instruct 128K $0.15 / 1M tokens $0.6 / 1M tokens
Venice
Venice | qwen/qwen3-vl-30b-a3b-instruct 128K $0.25 / 1M tokens $0.9 / 1M tokens
Other Models by qwen