Author's Description
DeepSeek R1 Distill Qwen 14B is a distilled large language model based on [Qwen 2.5 14B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). It outperforms OpenAI's o1-mini across various benchmarks, achieving new state-of-the-art results for dense models. Other benchmark results include: - AIME 2024 pass@1: 69.7 - MATH-500 pass@1: 93.9 - CodeForces Rating: 1481 The model leverages fine-tuning from DeepSeek R1's outputs, enabling competitive performance comparable to larger frontier models.
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
DeepSeek R1 Distill Qwen 14B, a distilled model leveraging DeepSeek R1 outputs, demonstrates strong performance across several key metrics. It consistently ranks among the fastest models, achieving an Infinityth percentile in speed across nine benchmarks. Its pricing is competitive, placing it in the 58th percentile across eight benchmarks. Furthermore, the model exhibits high reliability with a 91% success rate, indicating consistent operational stability. In terms of specific benchmark performance, the model shows notable strength in Coding, achieving 93.0% accuracy (88th percentile), and demonstrates solid capabilities in Mathematics (78.0% accuracy, 41st percentile) and Reasoning (66.0% accuracy, 53rd percentile). Its AIME 2024 pass@1 of 69.7 and MATH-500 pass@1 of 93.9 further underscore its mathematical and problem-solving prowess. However, the model shows weaknesses in Hallucinations (78.0% accuracy, 22nd percentile), General Knowledge (77.5% accuracy, 23rd percentile), and Ethics (87.5% accuracy, 22nd percentile), where its accuracy falls into lower percentiles. Instruction Following also presents a mixed picture, with one test showing 44.0% accuracy (41st percentile) and another indicating 0.0% accuracy, suggesting potential inconsistencies or specific challenges in complex instruction sets.
Model Pricing
Current Pricing
Feature | Price (per 1M tokens) |
---|---|
Prompt | $0.15 |
Completion | $0.15 |
Price History
Available Endpoints
Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
---|---|---|---|---|
Novita
|
Novita | deepseek/deepseek-r1-distill-qwen-14b | 32K | $0.15 / 1M tokens | $0.15 / 1M tokens |
GMICloud
|
GMICloud | deepseek/deepseek-r1-distill-qwen-14b | 131K | $0.15 / 1M tokens | $0.15 / 1M tokens |
Together
|
Together | deepseek/deepseek-r1-distill-qwen-14b | 131K | $1.6 / 1M tokens | $1.6 / 1M tokens |
Novita
|
Novita | deepseek/deepseek-r1-distill-qwen-14b | 32K | $0.15 / 1M tokens | $0.15 / 1M tokens |
Benchmark Results
Benchmark | Category | Reasoning | Strategy | Free | Executions | Accuracy | Cost | Duration |
---|
Other Models by deepseek
|
Released | Params | Context |
|
Speed | Ability | Cost |
---|---|---|---|---|---|---|---|
DeepSeek: DeepSeek V3.2 Exp | Sep 29, 2025 | — | 131K |
Text input
Text output
|
★★★ | ★★★★★ | $$$ |
DeepSeek: DeepSeek V3.1 Terminus | Sep 22, 2025 | ~671B | 131K |
Text input
Text output
|
★★★★ | ★★★★★ | $$$$ |
DeepSeek: DeepSeek V3.1 | Aug 21, 2025 | ~671B | 131K |
Text input
Text output
|
★★ | ★★★★ | $$$ |
DeepSeek: DeepSeek V3.1 Base Unavailable | Aug 20, 2025 | ~671B | 163K |
Text input
Text output
|
★ | ★ | $$ |
DeepSeek: R1 Distill Qwen 7B Unavailable | May 30, 2025 | 7B | 131K |
Text input
Text output
|
★ | ★ | $$$$ |
DeepSeek: DeepSeek R1 0528 Qwen3 8B | May 29, 2025 | 8B | 131K |
Text input
Text output
|
★★★ | ★★★ | $$ |
DeepSeek: R1 0528 | May 28, 2025 | ~671B | 128K |
Text input
Text output
|
★★★ | ★★★ | $$$ |
DeepSeek: DeepSeek Prover V2 | Apr 30, 2025 | ~671B | 131K |
Text input
Text output
|
★★ | ★★★★ | $$$$ |
DeepSeek: DeepSeek V3 Base Unavailable | Mar 29, 2025 | ~671B | 163K |
Text input
Text output
|
★ | ★ | $$$ |
DeepSeek: DeepSeek V3 0324 | Mar 24, 2025 | ~685B | 163K |
Text input
Text output
|
★★★★ | ★★★★★ | $$ |
DeepSeek: R1 Distill Llama 8B Unavailable | Feb 07, 2025 | 8B | 32K |
Text input
Text output
|
★ | ★★ | $$ |
DeepSeek: R1 Distill Qwen 1.5B Unavailable | Jan 31, 2025 | 5B | 131K |
Text input
Text output
|
★★★ | ★ | $$$ |
DeepSeek: R1 Distill Qwen 32B | Jan 29, 2025 | 32B | 131K |
Text input
Text output
|
★ | ★★★★ | $$$ |
DeepSeek: R1 Distill Llama 70B | Jan 23, 2025 | 70B | 131K |
Text input
Text output
|
★★★ | ★★★★★ | $$ |
DeepSeek: R1 | Jan 20, 2025 | ~671B | 128K |
Text input
Text output
|
★★★ | ★★★★ | $$$ |
DeepSeek: DeepSeek V3 | Dec 26, 2024 | — | 163K |
Text input
Text output
|
★★★ | ★★★★ | $$$ |