Mercury Coder Small Beta Check detailed information and pricing for AI models
Context Length 32,000 tokens, inception from provided
32,000
Context Tokens
$0.25
Prompt Price
$1.00
Output Price
2/16
Feature Support
Model Overview
Mercury Coder Small is the first diffusion large language model (dLLM). Applying a breakthrough discrete diffusion approach, the model runs 5-10x faster than even speed optimized models like Claude 3.5 Haiku and GPT-4o Mini while matching their performance. Mercury Coder Small's speed means that developers can stay in the flow while coding, enjoying rapid chat-based iteration and responsive code completion suggestions. On Copilot Arena, Mercury Coder ranks 1st in speed and ties for 2nd in quality. Read more in the [blog post here](https://www.inceptionlabs.ai/introducing-mercury).
Basic Information
Developer
inception
Model Series
Other
Release Date
2025-04-30
Context Length
32,000 tokens
Variant
standard
Pricing Information
Prompt Tokens
$0.25 / 1M tokens
Completion Tokens
$1.00 / 1M tokens
Data Policy
Supported Features
Supported (2)
Frequency Penalty
Presence Penalty
Unsupported (14)
Image Input
Top K
Seed
Repetition Penalty
Response Format
Min P
Logit Bias
Tool Usage
Logprobs
Top Logprobs
Structured Outputs
Reasoning
Web Search Options
Top A
Actual Usage Statistics
#240
Out of 345 total models
127.93M
Total Tokens Last 30 Days
4.26M
Daily Average Usage
54%
Weekly Usage Change