DeepSeek: R1 Distill Qwen 32B

Text input Text output
Author's Description

DeepSeek R1 Distill Qwen 32B is a distilled large language model based on [Qwen 2.5 32B](https://huggingface.co/Qwen/Qwen2.5-32B), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). It outperforms OpenAI's o1-mini across various benchmarks, achieving new...

Key Specifications
Cost
$$$
Context
131K
Parameters
32B
Released
Jan 29, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Temperature Include Reasoning Reasoning Presence Penalty Max Tokens Seed Min P Response Format Frequency Penalty Top P Stop
Features

This model supports the following features:

Reasoning Response Format
Performance Summary

DeepSeek R1 Distill Qwen 32B demonstrates a strong overall performance profile, particularly excelling in reliability with a 99% success rate, indicating consistent and usable responses. While its speed ranking places it among models with longer response times (16th percentile), it offers competitive pricing, ranking in the 63rd percentile for cost-effectiveness. The model exhibits notable strengths in specialized areas. It achieves high accuracy in Coding (93.0%, 83rd percentile), Reasoning (94.0%, 80th percentile), and Mathematics (93.0%, 73rd percentile), suggesting robust capabilities in complex problem-solving and logical inference. Its General Knowledge is also impressive at 98.5% accuracy. However, a significant weakness is observed in its ability to acknowledge uncertainty, with a Hallucinations (Baseline) accuracy of only 60.0% (13th percentile), indicating a tendency to provide answers rather than admitting a lack of knowledge on fictional concepts. Instruction Following (51.5% accuracy) and Ethics (97.5% accuracy) also fall into lower percentile rankings compared to its other strengths. This model is a strong contender for tasks requiring high accuracy in coding and mathematical reasoning, but users should be mindful of its propensity to hallucinate.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.29
Completion $0.29

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
DeepInfra
DeepInfra | deepseek/deepseek-r1-distill-qwen-32b 131K $0.29 / 1M tokens $0.29 / 1M tokens
Novita
Novita | deepseek/deepseek-r1-distill-qwen-32b 64K $0.29 / 1M tokens $0.29 / 1M tokens
GMICloud
GMICloud | deepseek/deepseek-r1-distill-qwen-32b 131K $0.29 / 1M tokens $0.29 / 1M tokens
Cloudflare
Cloudflare | deepseek/deepseek-r1-distill-qwen-32b 80K $0.29 / 1M tokens $0.29 / 1M tokens
Nineteen
Nineteen | deepseek/deepseek-r1-distill-qwen-32b 16K $0.29 / 1M tokens $0.29 / 1M tokens
NextBit
NextBit | deepseek/deepseek-r1-distill-qwen-32b 32K $0.29 / 1M tokens $0.29 / 1M tokens
Novita
Novita | deepseek/deepseek-r1-distill-qwen-32b 64K $0.29 / 1M tokens $0.29 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by deepseek