Author's Description
Mercury is the first diffusion large language model (dLLM). Applying a breakthrough discrete diffusion approach, the model runs 5-10x faster than even speed optimized models like GPT-4.1 Nano and Claude 3.5 Haiku while matching their performance. Mercury's speed enables developers to provide responsive user experiences, including with voice agents, search interfaces, and chatbots. Read more in the blog post here.
Key Specifications
Supported Parameters
This model supports the following parameters:
Features
This model supports the following features:
Performance Summary
Inception: Mercury, the first diffusion large language model (dLLM), demonstrates exceptional speed and competitive pricing. It consistently ranks among the fastest models, achieving Infinityth percentile across 8 benchmarks, and offers among the most competitive pricing, also at the Infinityth percentile across 8 benchmarks. Mercury exhibits strong performance in several key areas. It achieves near-perfect accuracy in Ethics (100%), ranking in the top 3 for speed and being the most accurate model at its price point. In Hallucinations (Baseline), it shows high accuracy (98.0%), placing in the top 3 for speed and being the most accurate among models of comparable speed. General Knowledge (Baseline) also highlights its speed, achieving 97.5% accuracy and ranking #1 in speed. Coding (Baseline) is another strength, with 82.0% accuracy and a #1 speed ranking. However, the model shows significant weaknesses in Mathematics (Baseline) and Reasoning (Baseline), where it scored 0.0% accuracy in both. Email Classification (Baseline) and Instruction Following (Baseline) show moderate performance at 96.0% and 47.0% accuracy respectively, with the former being noted for its speed. Overall, Mercury excels in speed-sensitive applications and ethical considerations, but its current capabilities in complex mathematical and reasoning tasks are limited.
Model Pricing
Current Pricing
Feature | Price (per 1M tokens) |
---|---|
Prompt | $0.25 |
Completion | $1 |
Price History
Available Endpoints
Provider | Endpoint Name | Context Length | Pricing (Input) | Pricing (Output) |
---|---|---|---|---|
Inception
|
Inception | inception/mercury | 128K | $0.25 / 1M tokens | $1 / 1M tokens |
Benchmark Results
Benchmark | Category | Reasoning | Strategy | Free | Executions | Accuracy | Cost | Duration |
---|
Other Models by inception
|
Released | Params | Context |
|
Speed | Ability | Cost |
---|---|---|---|---|---|---|---|
Inception: Mercury Coder | Apr 30, 2025 | — | 128K |
Text input
Text output
|
★★★★★ | ★★ | $$$ |