Qwen: Qwen2.5 7B Instruct

Text input Text output
Author's Description

Qwen2.5 7B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and...

Key Specifications
Cost
$
Context
32K
Parameters
500B (Rumoured)
Released
Oct 15, 2024
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Frequency Penalty Structured Outputs Top P Response Format Temperature Stop Presence Penalty Max Tokens
Features

This model supports the following features:

Structured Outputs Response Format
Performance Summary

Qwen2.5 7B Instruct demonstrates moderate speed performance, ranking in the 30th percentile across benchmarks. A significant strength lies in its pricing, consistently offering among the most competitive rates, placing it in the 91st percentile. The model also exhibits strong reliability with a 91% success rate, indicating consistent and usable responses. In terms of specific capabilities, Qwen2.5 7B shows excellent performance in hallucination avoidance with 98.0% accuracy, suggesting a strong ability to acknowledge uncertainty. It also performs well in Coding (83.0% accuracy) and Mathematics (80.5% accuracy), with the latter being the most accurate model at its price point. However, the model struggles with General Knowledge (76.5% accuracy), Ethics (61.0% accuracy), and Instruction Following (45.5% accuracy), where it ranks in the lower percentiles. Its Reasoning capabilities are also moderate at 56.8% accuracy. While it shows strong improvements in generating long texts and understanding structured data as per its description, these specific aspects are not directly reflected in the provided benchmark results. The model's long duration for General Knowledge, Ethics, and Reasoning benchmarks suggests potential inefficiencies in processing complex queries in these areas.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.04
Completion $0.1

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
NextBit
NextBit | qwen/qwen-2.5-7b-instruct 32K $0.04 / 1M tokens $0.1 / 1M tokens
DeepInfra
DeepInfra | qwen/qwen-2.5-7b-instruct 32K $0.04 / 1M tokens $0.1 / 1M tokens
Phala
Phala | qwen/qwen-2.5-7b-instruct 32K $0.04 / 1M tokens $0.1 / 1M tokens
Together
Together | qwen/qwen-2.5-7b-instruct 32K $0.3 / 1M tokens $0.3 / 1M tokens
Novita
Novita | qwen/qwen-2.5-7b-instruct 32K $0.04 / 1M tokens $0.1 / 1M tokens
NextBit
NextBit | qwen/qwen-2.5-7b-instruct 65K $0.04 / 1M tokens $0.1 / 1M tokens
Novita
Novita | qwen/qwen-2.5-7b-instruct 32K $0.04 / 1M tokens $0.1 / 1M tokens
AtlasCloud
AtlasCloud | qwen/qwen-2.5-7b-instruct 32K $0.04 / 1M tokens $0.1 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by qwen