Deep Cogito: Cogito V2 Preview Deepseek 671B

Text input Text output
Author's Description

Cogito v2 is a multilingual, instruction-tuned Mixture of Experts (MoE) large language model with 671 billion parameters. It supports both standard and reasoning-based generation modes. The model introduces hybrid reasoning via Iterated Distillation and Amplification (IDA)—an iterative self-improvement strategy designed to scale alignment with general intelligence. Cogito v2 has been optimized for STEM, programming, instruction following, and tool use. It supports 128k context length and offers strong performance in both multilingual and code-heavy environments. Users can control the reasoning behaviour with the `reasoning` `enabled` boolean. [Learn more in our docs](https://openrouter.ai/docs/use-cases/reasoning-tokens#enable-reasoning-with-default-config)

Key Specifications
Cost
$$$$
Context
163K
Parameters
671B
Released
Sep 02, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Include Reasoning Stop Max Tokens Top P Frequency Penalty Reasoning Logit Bias Min P Temperature Presence Penalty
Features

This model supports the following features:

Reasoning
Performance Summary

Deep Cogito's Cogito V2 Preview Deepseek 671B, a multilingual MoE model, demonstrates exceptional speed, consistently ranking among the fastest models across seven benchmarks. Its pricing is moderate, positioned at the 36th percentile. The model's core strengths lie in its advanced reasoning and mathematical capabilities. It achieved an impressive 88.0% accuracy in Reasoning, placing it in the 78th percentile, and an outstanding 94.0% accuracy in Mathematics, ranking in the 89th percentile. These results highlight its proficiency in complex problem-solving and quantitative tasks, aligning with its optimization for STEM fields. However, the model exhibits significant weaknesses in other critical areas. It scored 0.0% accuracy across Instruction Following, Coding, General Knowledge, Email Classification, and Ethics benchmarks. This indicates a substantial gap in its ability to follow instructions, perform coding tasks, recall general information, classify emails accurately, and navigate ethical scenarios. While optimized for STEM and programming, the benchmark results for coding do not reflect this optimization. The model's hybrid reasoning via Iterated Distillation and Amplification (IDA) appears to be highly effective for specific analytical tasks but does not translate to broader instruction adherence or factual recall in its current preview state.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $1.25
Completion $1.25

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Together
Together | deepcogito/cogito-v2-preview-deepseek-671b 163K $1.25 / 1M tokens $1.25 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by deepcogito