DeepSeek: DeepSeek V3.2 Exp

Text input Text output
Author's Description

DeepSeek-V3.2-Exp is an experimental large language model released by DeepSeek as an intermediate step between V3.1 and future architectures. It introduces DeepSeek Sparse Attention (DSA), a fine-grained sparse attention mechanism designed to improve training and inference efficiency in long-context scenarios while maintaining output quality. Users can control the reasoning behaviour with the `reasoning` `enabled` boolean. [Learn more in our docs](https://openrouter.ai/docs/use-cases/reasoning-tokens#enable-reasoning-with-default-config) The model was trained under conditions aligned with V3.1-Terminus to enable direct comparison. Benchmarking shows performance roughly on par with V3.1 across reasoning, coding, and agentic tool-use tasks, with minor tradeoffs and gains depending on the domain. This release focuses on validating architectural optimizations for extended context lengths rather than advancing raw task accuracy, making it primarily a research-oriented model for exploring efficient transformer designs.

Key Specifications
Context
131K
Released
Sep 29, 2025
Supported Parameters

This model supports the following parameters:

Frequency Penalty Include Reasoning Temperature Top Logprobs Max Tokens Presence Penalty Reasoning Top P Stop Logprobs
Features

This model supports the following features:

Reasoning
Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.28
Completion $0.42
Input Cache Read $0.028

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
DeepSeek
DeepSeek | deepseek/deepseek-v3.2-exp 131K $0.28 / 1M tokens $0.42 / 1M tokens
Novita
Novita | deepseek/deepseek-v3.2-exp 163K $0.27 / 1M tokens $0.41 / 1M tokens
Other Models by deepseek