Qwen2.5 72B Instruct

Text input Text output
Author's Description

Qwen2.5 72B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and...

Key Specifications
Cost
$$
Context
32K
Parameters
500B (Rumoured)
Released
Sep 18, 2024
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Seed Tools Frequency Penalty Top P Min P Response Format Temperature Stop Presence Penalty Tool Choice Max Tokens
Features

This model supports the following features:

Response Format Tools
Performance Summary

Qwen2.5 72B Instruct, created on September 18, 2024, demonstrates strong overall performance, consistently ranking among the fastest models across nine benchmarks. It also offers competitive pricing, typically falling within the 73rd percentile for cost-effectiveness. The model exhibits exceptional reliability with a 99% success rate, indicating minimal technical failures. In terms of specific capabilities, Qwen2.5 72B excels in several areas. It achieved perfect accuracy in Hallucinations (Baseline) and Ethics (Baseline) tests, showcasing its ability to acknowledge uncertainty and adhere to ethical principles. Its General Knowledge is also very strong at 99.5% accuracy. The model shows significant improvements in instruction following, achieving 67.0% accuracy in a complex instruction following benchmark, and performs well in Coding (85.0% accuracy). Its description highlights enhanced capabilities in generating long texts, understanding structured data, and producing structured outputs like JSON, alongside robust multilingual support for over 29 languages. While strong in many areas, the model's Mathematics performance, at 83.8% accuracy, is moderate compared to other benchmarks. Notably, one instance of the Instruction Following benchmark recorded 0.0% accuracy, which warrants further investigation as it contrasts sharply with another instruction following result. Overall, Qwen2.5 72B Instruct is a highly reliable and fast model with excellent knowledge retention, ethical reasoning, and improved instruction following, making it a versatile choice for various applications.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.12
Completion $0.39

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
DeepInfra
DeepInfra | qwen/qwen-2.5-72b-instruct 32K $0.12 / 1M tokens $0.39 / 1M tokens
Nebius
Nebius | qwen/qwen-2.5-72b-instruct 131K $0.12 / 1M tokens $0.39 / 1M tokens
Novita
Novita | qwen/qwen-2.5-72b-instruct 32K $0.12 / 1M tokens $0.39 / 1M tokens
Hyperbolic
Hyperbolic | qwen/qwen-2.5-72b-instruct 131K $0.12 / 1M tokens $0.39 / 1M tokens
Fireworks
Fireworks | qwen/qwen-2.5-72b-instruct 32K $0.12 / 1M tokens $0.39 / 1M tokens
Together
Together | qwen/qwen-2.5-72b-instruct 131K $0.12 / 1M tokens $0.39 / 1M tokens
Chutes
Chutes | qwen/qwen-2.5-72b-instruct 32K $0.12 / 1M tokens $0.39 / 1M tokens
NextBit
NextBit | qwen/qwen-2.5-72b-instruct 65K $0.12 / 1M tokens $0.39 / 1M tokens
Chutes
Chutes | qwen/qwen-2.5-72b-instruct 32K $0.12 / 1M tokens $0.39 / 1M tokens
Novita
Novita | qwen/qwen-2.5-72b-instruct 32K $0.38 / 1M tokens $0.4 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by qwen