Inception: Mercury Coder

Text input Text output
Author's Description

Mercury Coder is the first diffusion large language model (dLLM). Applying a breakthrough discrete diffusion approach, the model runs 5-10x faster than even speed optimized models like Claude 3.5 Haiku...

Key Specifications
Cost
$$$
Context
128K
Released
Apr 30, 2025
Speed
Ability
Reliability
Supported Parameters

This model supports the following parameters:

Tools Structured Outputs Response Format Stop Temperature Tool Choice Max Tokens
Features

This model supports the following features:

Structured Outputs Response Format Tools
Performance Summary

Inception's Mercury Coder, a novel diffusion large language model, demonstrates exceptional speed and competitive pricing, consistently ranking in the Infinityth percentile across multiple benchmarks for both metrics. Its reliability is strong, with an 84% success rate across seven benchmarks, indicating consistent operational performance. Mercury Coder excels in speed across various tasks, notably achieving top-tier performance in Hallucinations (32793ms), General Knowledge (89846ms), Ethics (47743ms), and Coding (60827ms), even securing the #1 spot and "Speed Champion" designation in Ethics with near-perfect accuracy. This speed is a core strength, enabling rapid iteration and responsive interactions. While its speed and cost-efficiency are standout features, the model exhibits varied accuracy across categories. It performs well in Hallucinations (96.0%) and Ethics (98.0%), but shows moderate performance in General Knowledge (93.5%), Instruction Following (54.0%), and Coding (80.0%). A significant weakness is observed in Mathematics, where it scored 0.0% accuracy, suggesting a critical area for improvement. Email Classification also shows room for growth at 92.0% accuracy. Overall, Mercury Coder's breakthrough speed and cost-effectiveness make it a compelling option, particularly for applications where rapid response times are paramount, though its mathematical capabilities require further development.

Model Pricing

Current Pricing

Feature Price (per 1M tokens)
Prompt $0.25
Completion $0.75
Input Cache Read $0.025

Price History

Available Endpoints
Provider Endpoint Name Context Length Pricing (Input) Pricing (Output)
Inception
Inception | inception/mercury-coder-small-beta 128K $0.25 / 1M tokens $0.75 / 1M tokens
Benchmark Results
Benchmark Category Reasoning Strategy Free Executions Accuracy Cost Duration
Other Models by inception