Mercury Coder Check detailed information and pricing for AI models

Context Length 128,000 tokens, inception from provided

128,000

Context Tokens

$0.25

Prompt Price

$1.00

Output Price

6/16

Feature Support

Model Overview

Mercury Coder is the first diffusion large language model (dLLM). Applying a breakthrough discrete diffusion approach, the model runs 5-10x faster than even speed optimized models like Claude 3.5 Haiku and GPT-4o Mini while matching their performance. Mercury Coder's speed means that developers can stay in the flow while coding, enjoying rapid chat-based iteration and responsive code completion suggestions. On Copilot Arena, Mercury Coder ranks 1st in speed and ties for 2nd in quality. Read more in the [blog post here](https://www.inceptionlabs.ai/introducing-mercury).

Basic Information

Developer

inception

Model Series

Other

Release Date

2025-04-30

Context Length

128,000 tokens

Max Completion Tokens

16,384 tokens

Variant

standard

Pricing Information

Prompt Tokens

$0.25 / 1M tokens

Completion Tokens

$1.00 / 1M tokens

Data Policy

Supported Features

Supported (6)

Top K

Frequency Penalty

Presence Penalty

Response Format

Tool Usage

Structured Outputs

Unsupported (10)

Image Input

Seed

Repetition Penalty

Min P

Logit Bias

Logprobs

Top Logprobs

Reasoning

Web Search Options

Top A

Actual Usage Statistics

#224

Out of 353 total models

339.57M

Total Tokens Last 30 Days

11.32M

Daily Average Usage

58%

Weekly Usage Change

Usage Trend for the Last 30 Days

View Details