Author's Description
DeepSeek Prover V2 is a 671B parameter model, speculated to be geared towards logic and mathematics. Likely an upgrade from [DeepSeek-Prover-V1.5](https://huggingface.co/deepseek-ai/DeepSeek-Prover-V1.5-RL) Not much is known about the model yet, as DeepSeek released it on Hugging Face without an announcement or description.
Key Specifications
Supported Parameters
This model supports the following parameters:
Performance Summary
DeepSeek Prover V2, a 671B parameter model from DeepSeek, demonstrates competitive performance across various benchmarks. With a context length of 131072, it exhibits competitive response times, ranking in the 54th percentile for speed, and offers competitive pricing, placing in the 40th percentile. The model shows particular strength in tasks requiring complex reasoning and coding. It achieved 91.0% accuracy in the Coding (Baseline) benchmark and 84.0% in Reasoning (Baseline), both ranking in the 84th percentile for accuracy. This aligns with its speculated focus on logic and mathematics. Furthermore, it performed exceptionally well in Ethics (Baseline) and General Knowledge (Baseline), scoring 99.0% accuracy in both. A notable weakness appears in the Email Classification (Baseline) benchmark, where its 93.0% accuracy places it in the 25th percentile, suggesting room for improvement in specific classification tasks. While its cost performance is generally competitive, its duration for certain tasks, like Coding and Ethics, is on the higher side. Overall, DeepSeek Prover V2 presents a robust offering, particularly for tasks demanding strong logical and coding capabilities, with competitive pricing and moderate speed.
Model Pricing
Current Pricing
Feature | Price (per 1M tokens) |
---|---|
Prompt | $0.5 |
Completion | $2.18 |
Price History
Available Endpoints
Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
---|---|---|---|---|
GMICloud
|
GMICloud | deepseek/deepseek-prover-v2 | 131K | $0.5 / 1M tokens | $2.18 / 1M tokens |
DeepInfra
|
DeepInfra | deepseek/deepseek-prover-v2 | 163K | $0.5 / 1M tokens | $2.18 / 1M tokens |
Novita
|
Novita | deepseek/deepseek-prover-v2 | 160K | $0.5 / 1M tokens | $2.18 / 1M tokens |
Benchmark Results
Benchmark | Category | Reasoning | Free | Executions | Accuracy | Cost | Duration |
---|
Other Models by deepseek
|
Released | Params | Context |
|
Speed | Ability | Cost |
---|---|---|---|---|---|---|---|
DeepSeek: R1 Distill Qwen 7B | May 30, 2025 | 7B | 131K |
Text input
Text output
|
★ | ★ | $$$$ |
DeepSeek: Deepseek R1 0528 Qwen3 8B | May 29, 2025 | 8B | 131K |
Text input
Text output
|
★ | ★★★★★ | $$$ |
DeepSeek: R1 0528 | May 28, 2025 | ~671B | 128K |
Text input
Text output
|
★ | ★★★★★ | $$$$$ |
DeepSeek: DeepSeek V3 0324 | Mar 24, 2025 | ~685B | 163K |
Text input
Text output
|
★★★ | ★★★★★ | $$$ |
DeepSeek: R1 Distill Llama 8B | Feb 07, 2025 | 8B | 32K |
Text input
Text output
|
★ | ★★★ | $$ |
DeepSeek: R1 Distill Qwen 1.5B | Jan 31, 2025 | 5B | 131K |
Text input
Text output
|
★★★ | ★ | $$$ |
DeepSeek: R1 Distill Qwen 32B | Jan 29, 2025 | 32B | 131K |
Text input
Text output
|
★ | ★★★★★ | $$$ |
DeepSeek: R1 Distill Qwen 14B | Jan 29, 2025 | 14B | 64K |
Text input
Text output
|
★ | ★★★ | $$$ |
DeepSeek: R1 Distill Llama 70B | Jan 23, 2025 | 70B | 131K |
Text input
Text output
|
★ | ★★★★★ | $$$$ |
DeepSeek: R1 | Jan 20, 2025 | ~671B | 128K |
Text input
Text output
|
★★ | ★★★★ | $$$$ |
DeepSeek: DeepSeek V3 | Dec 26, 2024 | — | 163K |
Text input
Text output
|
★★★ | ★★★★ | $$$ |