Deepseek R1 0528 Qwen3 8B Check detailed information and pricing for AI models

Context Length 131,072 tokens, deepseek from provided

131,072
Context Tokens
$0.05
Prompt Price
$0.10
Output Price
5/16
Feature Support
legal #3
Reasoning #14

Model Overview

DeepSeek-R1-0528 is a lightly upgraded release of DeepSeek R1 that taps more compute and smarter post-training tricks, pushing its reasoning and inference to the brink of flagship models like O3 and Gemini 2.5 Pro. It now tops math, programming, and logic leaderboards, showcasing a step-change in depth-of-thought. The distilled variant, DeepSeek-R1-0528-Qwen3-8B, transfers this chain-of-thought into an 8 B-parameter form, beating standard Qwen3 8B by +10 pp and tying the 235 B “thinking” giant on AIME 2024.

Basic Information

Developer
deepseek
Model Series
Qwen
Release Date
2025-05-29
Context Length
131,072 tokens
Max Completion Tokens
131,072 tokens
Variant
standard

Pricing Information

Prompt Tokens
$0.05 / 1M tokens
Completion Tokens
$0.10 / 1M tokens

Supported Features

Supported (5)

Top K
Frequency Penalty
Presence Penalty
Repetition Penalty
Reasoning

Unsupported (11)

Image Input
Seed
Response Format
Min P
Logit Bias
Tool Usage
Logprobs
Top Logprobs
Structured Outputs
Web Search Options
Top A

Other Variants

Actual Usage Statistics

#96
Out of 346 total models
4.4B
Total Tokens Last 30 Days
220.39M
Daily Average Usage
192%
Weekly Usage Change

Usage Trend for the Last 30 Days

Models by Same Author (deepseek)

R1 Distill Qwen 7B
131,072 tokens
$0.10 / $0.20
R1 0528 (free)
163,840 tokens
Free
R1 0528
128,000 tokens
$0.50 / $2.15
DeepSeek Prover V2 (free)
163,840 tokens
Free
DeepSeek Prover V2
131,072 tokens
$0.50 / $2.18

Similar Price Range Models

Gemma 3 12B
google
131,072 tokens
$0.05 / $0.10
Phi 4 Multimodal Instruct
microsoft
131,072 tokens
$0.05 / $0.10
Mistral Small 3
mistralai
32,768 tokens
$0.05 / $0.09
Qwen2.5 7B Instruct
qwen
32,768 tokens
$0.04 / $0.10
Devstral Small
mistralai
128,000 tokens
$0.06 / $0.12