Author's Description
DeepSeek-V3 is the latest model from the DeepSeek team, building upon the instruction following and coding abilities of the previous versions. Pre-trained on nearly 15 trillion tokens, the reported evaluations reveal that the model outperforms other open-source models and rivals leading closed-source models. For model details, please visit [the DeepSeek-V3 repo](https://github.com/deepseek-ai/DeepSeek-V3) for more information, or see the [launch announcement](https://api-docs.deepseek.com/news/news1226).
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
DeepSeek-V3, the latest model from DeepSeek, demonstrates a strong overall performance profile, particularly excelling in accuracy across various benchmarks. With a context length of 163840 tokens and pre-trained on nearly 15 trillion tokens, it aims to rival leading closed-source models. In terms of speed, DeepSeek-V3 exhibits competitive response times, ranking in the 50th percentile across five benchmarks, indicating it performs comparably to many models in its class. From a cost perspective, it offers competitive pricing, placing in the 58th percentile, making it an economically viable option for many applications. Analyzing specific benchmark results, DeepSeek-V3 shows exceptional accuracy in "Ethics (Baseline)" with a perfect 100% score, highlighting its robust ethical reasoning capabilities and making it the most accurate model at its price point and speed. It also achieves high accuracy in "General Knowledge (Baseline)" (99.5%) and "Reasoning (Baseline)" (84.0%), both ranking in the 85th percentile, showcasing strong cognitive and knowledge retrieval abilities. Its "Coding (Baseline)" performance is commendable at 89.0% accuracy (80th percentile), indicating strong programming proficiency. The "Email Classification (Baseline)" also performs well at 98.0% accuracy. While generally strong, a notable observation is its duration for "Coding (Baseline)" at 526045ms (37th percentile), suggesting it might be slower for complex coding tasks compared to some peers, despite its high accuracy in this domain. Overall, DeepSeek-V3 stands out for its high accuracy across diverse tasks, particularly in ethical and knowledge-based reasoning, offering a compelling balance of performance and cost-effectiveness.
Model Pricing
Current Pricing
Feature | Price (per 1M tokens) |
---|---|
Prompt | $0.3 |
Completion | $0.85 |
Price History
Available Endpoints
Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
---|---|---|---|---|
DeepInfra
|
DeepInfra | deepseek/deepseek-chat-v3 | 163K | $0.3 / 1M tokens | $0.85 / 1M tokens |
Novita
|
Novita | deepseek/deepseek-chat-v3 | 64K | $0.3 / 1M tokens | $0.85 / 1M tokens |
Nebius
|
Nebius | deepseek/deepseek-chat-v3 | 163K | $0.3 / 1M tokens | $0.85 / 1M tokens |
Fireworks
|
Fireworks | deepseek/deepseek-chat-v3 | 131K | $0.3 / 1M tokens | $0.85 / 1M tokens |
DeepInfra
|
DeepInfra | deepseek/deepseek-chat-v3 | 163K | $0.3 / 1M tokens | $0.85 / 1M tokens |
Targon
|
Targon | deepseek/deepseek-chat-v3 | 163K | $0.3 / 1M tokens | $0.85 / 1M tokens |
Benchmark Results
Benchmark | Category | Reasoning | Free | Executions | Accuracy | Cost | Duration |
---|
Other Models by deepseek
|
Released | Params | Context |
|
Speed | Ability | Cost |
---|---|---|---|---|---|---|---|
DeepSeek: R1 Distill Qwen 7B | May 30, 2025 | 7B | 131K |
Text input
Text output
|
★ | ★ | $$$$ |
DeepSeek: Deepseek R1 0528 Qwen3 8B | May 29, 2025 | 8B | 131K |
Text input
Text output
|
★ | ★★★★★ | $$$ |
DeepSeek: R1 0528 | May 28, 2025 | ~671B | 128K |
Text input
Text output
|
★ | ★★★★★ | $$$$$ |
DeepSeek: DeepSeek Prover V2 | Apr 30, 2025 | ~671B | 131K |
Text input
Text output
|
★★★★ | ★★★★★ | $$$$ |
DeepSeek: DeepSeek V3 0324 | Mar 24, 2025 | ~685B | 163K |
Text input
Text output
|
★★★ | ★★★★★ | $$$ |
DeepSeek: R1 Distill Llama 8B | Feb 07, 2025 | 8B | 32K |
Text input
Text output
|
★ | ★★★ | $$ |
DeepSeek: R1 Distill Qwen 1.5B | Jan 31, 2025 | 5B | 131K |
Text input
Text output
|
★★★ | ★ | $$$ |
DeepSeek: R1 Distill Qwen 32B | Jan 29, 2025 | 32B | 131K |
Text input
Text output
|
★ | ★★★★★ | $$$ |
DeepSeek: R1 Distill Qwen 14B | Jan 29, 2025 | 14B | 64K |
Text input
Text output
|
★ | ★★★ | $$$ |
DeepSeek: R1 Distill Llama 70B | Jan 23, 2025 | 70B | 131K |
Text input
Text output
|
★ | ★★★★★ | $$$$ |
DeepSeek: R1 | Jan 20, 2025 | ~671B | 128K |
Text input
Text output
|
★★ | ★★★★ | $$$$ |