inception

Inception: Mercury

Mercury is the first diffusion large language model (dLLM). Applying a breakthrough discrete diffusion approach, the model runs 5-10x faster than even speed optimized models like GPT-4.1 Nano and Claude 3.5 Haiku while matching their performance. Mercury's speed enables developers to provide responsive user experiences, including with voice agents, search interfaces, and chatbots. Read more in the [blog post] (https://www.inceptionlabs.ai/blog/introducing-mercury) here.

Input Cost
$0.25
per 1M tokens
Output Cost
$1.00
per 1M tokens
Context Window
128,000
tokens
Compare vs GPT-4o
Developer ID: inception/mercury

Related Models

inception
$0.25/1M

Inception: Mercury Coder

Mercury Coder is the first diffusion large language model (dLLM). Applying a breakthrough ...

📝 128,000 ctx Compare →
writer
$0.60/1M

Writer: Palmyra X5

Palmyra X5 is Writer's most advanced model, purpose-built for building and scaling AI agen...

📝 1,040,000 ctx Compare →
openai
$2.50/1M

OpenAI: GPT Audio

The gpt-audio model is OpenAI's first generally available audio model. The new snapshot fe...

📝 128,000 ctx Compare →