Inception: Mercury Coder

Text input Text output
Author's Description

Mercury Coder is the first diffusion large language model (dLLM). Applying a breakthrough discrete diffusion approach, the model runs 5-10x faster than even speed optimized models like Claude 3.5 Haiku and GPT-4o Mini while matching their performance. Mercury Coder's speed means that developers can stay in the flow while coding, enjoying rapid chat-based iteration and responsive code completion suggestions. On Copilot Arena, Mercury Coder ranks 1st in speed and ties for 2nd in quality. Read more in the [blog post here](https://www.inceptionlabs.ai/introducing-mercury).

Key Specifications
Cost
$$$
Context
128K
Released
Apr 30, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Presence Penalty Stop Top P Tool Choice Temperature Tools Structured Outputs Response Format Frequency Penalty Max Tokens
Features

This model supports the following features:

Tools Structured Outputs Response Format
Performance Summary

Inception: Mercury Coder, launched on April 30, 2025, is a groundbreaking diffusion large language model (dLLM) that consistently performs among the fastest models, ranking in the 99th percentile for speed across various benchmarks. It also offers competitive pricing, typically providing cost-effective solutions (62nd percentile), and demonstrates exceptional reliability with minimal technical failures (98th percentile). Across benchmark categories, Mercury Coder showcases remarkable speed, securing top 3 positions in Coding, Email Classification, and General Knowledge, and achieving #1 in Reasoning and Ethics, where it is noted as a "Speed champion" with near-perfect accuracy. While its speed is a clear strength, its accuracy varies. It achieved 80.0% in Coding (50th percentile) and 58.0% in Reasoning (49th percentile), indicating average performance in these areas. Its 92.0% accuracy in Email Classification, however, places it at the 21st percentile, suggesting a relative weakness in this specific classification task despite its speed. Conversely, it excels in Ethics (98.0% accuracy) and General Knowledge (93.5% accuracy), demonstrating strong capabilities in these domains. Its breakthrough discrete diffusion approach enables 5-10x faster operation than speed-optimized competitors, making it highly appealing for rapid iteration and responsive code completion.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.25
Completion $1

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Inception
Inception | inception/mercury-coder-small-beta 128K $0.25 / 1M tokens $1 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Free Executions Accuracy Cost Duration
Other Models by inception