ERNIE 4.5 21B A3B Check detailed information and pricing for AI models

Context Length 120,000 tokens, baidu from provided

120,000
Context Tokens
$0.07
Prompt Price
$0.28
Output Price
7/16
Feature Support

Model Overview

A sophisticated text-based Mixture-of-Experts (MoE) model featuring 21B total parameters with 3B activated per token, delivering exceptional multimodal understanding and generation through heterogeneous MoE structures and modality-isolated routing. Supporting an extensive 131K token context length, the model achieves efficient inference via multi-expert parallel collaboration and quantization, while advanced post-training techniques including SFT, DPO, and UPO ensure optimized performance across diverse applications with specialized routing and balancing losses for superior task handling.

Basic Information

Developer
baidu
Model Series
Other
Release Date
2025-08-12
Context Length
120,000 tokens
Max Completion Tokens
8,000 tokens
Variant
standard

Pricing Information

Prompt Tokens
$0.07 / 1M tokens
Completion Tokens
$0.28 / 1M tokens

Supported Features

Supported (7)

Top K
Seed
Frequency Penalty
Presence Penalty
Repetition Penalty
Min P
Logit Bias

Unsupported (9)

Image Input
Response Format
Tool Usage
Logprobs
Top Logprobs
Structured Outputs
Reasoning
Web Search Options
Top A

Actual Usage Statistics

#269
Out of 353 total models
126.83M
Total Tokens Last 30 Days
25.37M
Daily Average Usage
-
Weekly Usage Change

Usage Trend for the Last 30 Days

Models by Same Author (baidu)

ERNIE 4.5 VL 28B A3B
30,000 tokens
$0.14 / $0.56
ERNIE 4.5 VL 424B A47B
123,000 tokens
$0.42 / $1.25
ERNIE 4.5 300B A47B
123,000 tokens
$0.28 / $1.10

Similar Price Range Models

Devstral Small 1.1
mistralai
128,000 tokens
$0.07 / $0.28
gpt-oss-120b
openai
131,000 tokens
$0.07 / $0.28
Qwen3 30B A3B Thinking 2507
qwen
262,144 tokens
$0.07 / $0.29
Gemma 3 27B
google
96,000 tokens
$0.07 / $0.27
Gemini 1.5 Flash
google
1,000,000 tokens
$0.08 / $0.30