DeepSeek: R1

Text input Text output
Author's Description

DeepSeek R1 is here: Performance on par with [OpenAI o1](/openai/o1), but open-sourced and with fully open reasoning tokens. It's 671B parameters in size, with 37B active in an inference pass....

Key Specifications
Cost
$$$
Context
128K
Parameters
671B (Rumoured)
Released
Jan 20, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Include Reasoning Top Logprobs Max Tokens Seed Min P Structured Outputs Frequency Penalty Stop Temperature Reasoning Presence Penalty Response Format Logit Bias Top P
Features

This model supports the following features:

Structured Outputs Reasoning Response Format
Performance Summary

DeepSeek R1, a 671B parameter open-source model with 37B active in inference, demonstrates strong performance across several key areas. It consistently ranks among the fastest models and offers highly competitive pricing, making it an attractive option for various applications. In terms of accuracy, DeepSeek R1 excels in Email Classification, achieving perfect 100% accuracy and standing out as the most accurate model at its price point and among models of similar speed. It also shows strong capabilities in Instruction Following, with one benchmark reaching 100% accuracy and another at 80%, placing it in the 89th percentile. Coding performance is robust at 93% accuracy. However, the model exhibits significant weaknesses in complex Reasoning, scoring 0% accuracy, and in Mathematics, with only 2.4% accuracy. Its performance on Hallucinations is moderate at 94%, while General Knowledge and Ethics are respectable at 96.5% and 96% respectively. The model's open-source nature and MIT license for free commercialization are significant advantages.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.7
Completion $2.5

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
InferenceNet
InferenceNet | deepseek/deepseek-r1 128K $0.7 / 1M tokens $2.5 / 1M tokens
DeepInfra
DeepInfra | deepseek/deepseek-r1 163K $0.7 / 1M tokens $2.5 / 1M tokens
Lambda
Lambda | deepseek/deepseek-r1 163K $0.7 / 1M tokens $2.5 / 1M tokens
Novita
Novita | deepseek/deepseek-r1 64K $0.7 / 1M tokens $2.5 / 1M tokens
Nebius
Nebius | deepseek/deepseek-r1 163K $0.7 / 1M tokens $2.5 / 1M tokens
DeepInfra
DeepInfra | deepseek/deepseek-r1 40K $0.7 / 1M tokens $2.5 / 1M tokens
Kluster
Kluster | deepseek/deepseek-r1 163K $0.7 / 1M tokens $2.5 / 1M tokens
Cent-ML
Cent-ML | deepseek/deepseek-r1 131K $0.7 / 1M tokens $2.5 / 1M tokens
Nebius
Nebius | deepseek/deepseek-r1 163K $0.7 / 1M tokens $2.5 / 1M tokens
Friendli
Friendli | deepseek/deepseek-r1 163K $0.7 / 1M tokens $2.5 / 1M tokens
Fireworks
Fireworks | deepseek/deepseek-r1 163K $0.7 / 1M tokens $2.5 / 1M tokens
Minimax
Minimax | deepseek/deepseek-r1 64K $0.7 / 1M tokens $2.5 / 1M tokens
Azure
Azure | deepseek/deepseek-r1 163K $1.49 / 1M tokens $5.94 / 1M tokens
Targon
Targon | deepseek/deepseek-r1 163K $0.7 / 1M tokens $2.5 / 1M tokens
Chutes
Chutes | deepseek/deepseek-r1 163K $0.7 / 1M tokens $2.5 / 1M tokens
Novita
Novita | deepseek/deepseek-r1 64K $0.7 / 1M tokens $2.5 / 1M tokens
Chutes
Chutes | deepseek/deepseek-r1 163K $0.7 / 1M tokens $2.5 / 1M tokens
Hyperbolic
Hyperbolic | deepseek/deepseek-r1 163K $0.7 / 1M tokens $2.5 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by deepseek