Loading...
Loading...
Mercury Coder is the first diffusion large language model (dLLM). Applying a breakthrough discrete diffusion approach, the model runs 5-10x faster than even speed optimized models like Claude 3.5 Haiku and GPT-4o Mini while matching their performance. Mercury Coder's speed means that developers can stay in the flow while coding, enjoying rapid chat-based iteration and responsive code completion suggestions. On Copilot Arena, Mercury Coder ranks 1st in speed and ties for 2nd in quality. Read more in the [blog post here](https://www.inceptionlabs.ai/blog/introducing-mercury).
Price/1M
$0.38
357th cheapest
21% above median
Top 53%
Context Window
128K
225th largest
Top 75%
Input
$0.25
per 1M tokens
Output
$0.75
per 1M tokens
Blended
$0.38
per 1M tokens
Cheaper than 47% of models. Median price is $0.31/1M tokens.
Daily
$0.38
Monthly
$11.25
Context Window
128K
tokens
Larger than 25% of models
Max Output
32K
tokens
25% of context
Context Window Comparison