Author's Description
Qwen2 72B is a transformer-based model that excels in language understanding, multilingual capabilities, coding, mathematics, and reasoning. It features SwiGLU activation, attention QKV bias, and group query attention. It is pretrained on extensive data with supervised finetuning and direct preference optimization. For more details, see this [blog post](https://qwenlm.github.io/blog/qwen2/) and [GitHub repo](https://github.com/QwenLM/Qwen2). Usage of this model is subject to [Tongyi Qianwen LICENSE AGREEMENT](https://huggingface.co/Qwen/Qwen1.5-110B-Chat/blob/main/LICENSE).
Key Specifications
Supported Parameters
This model supports the following parameters:
Performance Summary
Qwen 2 72B Instruct demonstrates a balanced performance profile, excelling in certain areas while showing room for improvement in others. In terms of speed, the model performs among the faster options, ranking in the 62nd percentile across various benchmarks. Its pricing is competitive, positioned at the 42nd percentile, making it a cost-effective choice for many applications. Notably, its reliability is a significant strength, with an 83rd percentile ranking, indicating consistent and usable responses with minimal technical failures. Analyzing benchmark results, Qwen 2 72B Instruct shows exceptional performance in Ethics, achieving perfect accuracy and being the most accurate model at its price point and speed. It also performs very well in Email Classification (99% accuracy), highlighting its strong capabilities in structured classification tasks. Instruction Following and Reasoning show moderate accuracy (53% and 64% respectively), suggesting solid foundational understanding. However, the model exhibits weaknesses in Coding (28% accuracy) and General Knowledge (20% accuracy), indicating these areas may require further refinement. Overall, its strengths lie in ethical reasoning, classification, and reliability, while coding and broad general knowledge are areas for development.
Model Pricing
Current Pricing
Feature | Price (per 1M tokens) |
---|---|
Prompt | $0.9 |
Completion | $0.9 |
Price History
Available Endpoints
Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
---|---|---|---|---|
Together
|
Together | qwen/qwen-2-72b-instruct | 32K | $0.9 / 1M tokens | $0.9 / 1M tokens |
Benchmark Results
Benchmark | Category | Reasoning | Free | Executions | Accuracy | Cost | Duration |
---|
Other Models by qwen
|
Released | Params | Context |
|
Speed | Ability | Cost |
---|---|---|---|---|---|---|---|
Qwen: Qwen3 30B A3B Instruct 2507 | Jul 29, 2025 | 30B | 131K |
Text input
Text output
|
★★★★ | ★★★★ | $$$ |
Qwen: Qwen3 235B A22B Thinking 2507 | Jul 25, 2025 | 235B | 131K |
Text input
Text output
|
★ | ★★★★ | $$$$$ |
Qwen: Qwen3 Coder | Jul 22, 2025 | 480B | 1M |
Text input
Text output
|
★★★★ | ★★★ | $$$ |
Qwen: Qwen3 235B A22B Instruct 2507 | Jul 21, 2025 | 235B | 262K |
Text input
Text output
|
★ | ★★★ | $$$ |
Qwen: Qwen3 30B A3B | Apr 28, 2025 | 30B | 40K |
Text input
Text output
|
★ | ★★★★★ | $$$$ |
Qwen: Qwen3 8B | Apr 28, 2025 | 8B | 128K |
Text input
Text output
|
★ | ★★★ | $$$ |
Qwen: Qwen3 14B | Apr 28, 2025 | 14B | 40K |
Text input
Text output
|
★★ | ★★★★★ | $$$ |
Qwen: Qwen3 32B | Apr 28, 2025 | 32B | 40K |
Text input
Text output
|
★ | ★★★★★ | $$$ |
Qwen: Qwen3 235B A22B | Apr 28, 2025 | 235B | 40K |
Text input
Text output
|
★ | ★★★★ | $$$$ |
Qwen: Qwen2.5 VL 32B Instruct | Mar 24, 2025 | 32B | 128K |
Text input
Image input
Text output
|
★★ | ★★★ | $$$ |
Qwen: QwQ 32B | Mar 05, 2025 | 32B | 131K |
Text input
Text output
|
★ | ★★★ | $$$ |
Qwen: Qwen VL Plus | Feb 04, 2025 | — | 7K |
Text input
Image input
Text output
|
★★★★ | ★★ | $$$ |
Qwen: Qwen VL Max | Feb 01, 2025 | — | 7K |
Text input
Image input
Text output
|
★★★★★ | ★★★ | $$$$ |
Qwen: Qwen-Turbo | Feb 01, 2025 | — | 1M |
Text input
Text output
|
★★★★★ | ★★★★ | $$ |
Qwen: Qwen2.5 VL 72B Instruct | Feb 01, 2025 | 72B | 32K |
Text input
Image input
Text output
|
★★★★ | ★★★★ | $$ |
Qwen: Qwen-Plus | Feb 01, 2025 | — | 131K |
Text input
Text output
|
★★★ | ★★★★ | $$$ |
Qwen: Qwen-Max | Feb 01, 2025 | — | 32K |
Text input
Text output
|
★★★ | ★★★★ | $$$$ |
Qwen: QwQ 32B Preview | Nov 27, 2024 | 32B | 32K |
Text input
Text output
|
— | ★ | $$ |
Qwen2.5 Coder 32B Instruct | Nov 11, 2024 | ~500B | 32K |
Text input
Text output
|
★★★★★ | ★★★★★ | $ |
Qwen2.5 7B Instruct | Oct 15, 2024 | ~500B | 32K |
Text input
Text output
|
★ | ★★★ | $$ |
Qwen2.5 72B Instruct | Sep 18, 2024 | ~500B | 32K |
Text input
Text output
|
★★★ | ★★★ | $$$ |
Qwen: Qwen2.5-VL 7B Instruct | Aug 27, 2024 | ~500B | 32K |
Text input
Image input
Text output
|
★★★ | ★★★ | $$$ |