Loading...
Loading...
Mercury is the first diffusion large language model (dLLM). Applying a breakthrough discrete diffusion approach, the model runs 5-10x faster than even speed optimized models like GPT-4.1 Nano and Claude 3.5 Haiku while matching their performance. Mercury's speed enables developers to provide responsive user experiences, including with voice agents, search interfaces, and chatbots. Read more in the [blog post] (https://www.inceptionlabs.ai/blog/introducing-mercury) here.
Preço/1M
$0.38
357th mais barato
21% acima da mediana
Top 53%
Janela de Contexto
128K
225th maior
Top 75%
Entrada
$0.25
por 1M tokens
Saída
$0.75
por 1M tokens
Combinado
$0.38
por 1M tokens
Mais barato que 47% dos modelos. Preço mediano é $0.31/1M tokens.
Diário
$0.38
Mensal
$11.25
Janela de Contexto
128K
tokens
Maior que 25% dos modelos
Saída Máxima
32K
tokens
25% do contexto
Comparação de Janela de Contexto