Author's Description
Cogito v2 is a multilingual, instruction-tuned Mixture of Experts (MoE) large language model with 671 billion parameters. It supports both standard and reasoning-based generation modes. The model introduces hybrid reasoning via Iterated Distillation and Amplification (IDA)—an iterative self-improvement strategy designed to scale alignment with general intelligence. Cogito v2 has been optimized for STEM, programming, instruction following, and tool use. It supports 128k context length and offers strong performance in both multilingual and code-heavy environments. Users can control the reasoning behaviour with the `reasoning` `enabled` boolean. [Learn more in our docs](https://openrouter.ai/docs/use-cases/reasoning-tokens#enable-reasoning-with-default-config)
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
Deep Cogito's Cogito V2 Preview Deepseek 671B, a multilingual MoE model, demonstrates exceptional speed, consistently ranking among the fastest models across seven benchmarks. Its pricing is moderate, positioned at the 36th percentile. The model's core strengths lie in its advanced reasoning and mathematical capabilities. It achieved an impressive 88.0% accuracy in Reasoning, placing it in the 78th percentile, and an outstanding 94.0% accuracy in Mathematics, ranking in the 89th percentile. These results highlight its proficiency in complex problem-solving and quantitative tasks, aligning with its optimization for STEM fields. However, the model exhibits significant weaknesses in other critical areas. It scored 0.0% accuracy across Instruction Following, Coding, General Knowledge, Email Classification, and Ethics benchmarks. This indicates a substantial gap in its ability to follow instructions, perform coding tasks, recall general information, classify emails accurately, and navigate ethical scenarios. While optimized for STEM and programming, the benchmark results for coding do not reflect this optimization. The model's hybrid reasoning via Iterated Distillation and Amplification (IDA) appears to be highly effective for specific analytical tasks but does not translate to broader instruction adherence or factual recall in its current preview state.
Model Pricing
Current Pricing
Feature | Price (per 1M tokens) |
---|---|
Prompt | $1.25 |
Completion | $1.25 |
Price History
Available Endpoints
Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
---|---|---|---|---|
Together
|
Together | deepcogito/cogito-v2-preview-deepseek-671b | 163K | $1.25 / 1M tokens | $1.25 / 1M tokens |
Benchmark Results
Benchmark | Category | Reasoning | Strategy | Free | Executions | Accuracy | Cost | Duration |
---|
Other Models by deepcogito
|
Released | Params | Context |
|
Speed | Ability | Cost |
---|---|---|---|---|---|---|---|
Deep Cogito: Cogito V2 Preview Llama 405B Unavailable | Oct 17, 2025 | 405B | 32K |
Text input
Text output
|
— | — | — |
Deep Cogito: Cogito V2 Preview Llama 70B | Sep 02, 2025 | 70B | 32K |
Text input
Text output
|
— | — | — |
Cogito V2 Preview Llama 109B | Sep 02, 2025 | 109B | 32K |
Image input
Text input
Text output
|
★★★★★ | ★ | $$ |