Qwen: Qwen3 8B

Text input Text output
Author's Description

Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed for both reasoning-heavy tasks and efficient dialogue. It supports seamless switching between "thinking" mode for math, coding, and logical inference, and "non-thinking" mode for general conversation. The model is fine-tuned for instruction-following, agent integration, creative writing, and multilingual use across 100+ languages and dialects. It natively supports a 32K token context window and can extend to 131K tokens with YaRN scaling.

Key Specifications
Cost
$$$
Context
128K
Parameters
8B
Released
Apr 28, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Logit Bias Reasoning Include Reasoning Stop Seed Min P Top P Max Tokens Frequency Penalty Temperature Presence Penalty
Features

This model supports the following features:

Reasoning
Performance Summary

Qwen3-8B, a dense 8.2B parameter model, generally exhibits longer response times, ranking in the 15th percentile for speed across benchmarks. Conversely, it offers cost-effective solutions, placing in the 67th percentile for price. The model demonstrates strong reliability with an 89% success rate, indicating consistent and usable responses. In terms of performance across categories, Qwen3-8B shows notable strengths in General Knowledge (97.5% accuracy) and Ethics (99.0% accuracy), performing well within the 55th and 60th percentiles respectively. It also achieves a respectable 98.0% accuracy in Email Classification. Its instruction-following capabilities are moderate at 60.0% accuracy. However, the model struggles significantly with Coding, achieving only 34.0% accuracy (19th percentile), and shows average performance in Reasoning (56.0% accuracy) and Mathematics (85.0% accuracy). Its extended context window of 131K tokens and multilingual support are key features.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.035
Completion $0.138

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Novita
Novita | qwen/qwen3-8b-04-28 128K $0.035 / 1M tokens $0.138 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by qwen