DeepSeek V3.1 Base Check detailed information and pricing for AI models

Context Length 163,840 tokens, deepseek from provided

163,840
Context Tokens
$0.20
Prompt Price
$0.80
Output Price
9/16
Feature Support

Model Overview

This is a base model, trained only for raw next-token prediction. Unlike instruct/chat models, it has not been fine-tuned to follow user instructions. Prompts need to be written more like training text or examples rather than simple requests (e.g., “Translate the following sentence…” instead of just “Translate this”). DeepSeek-V3.1 Base is a 671B parameter open Mixture-of-Experts (MoE) language model with 37B active parameters per forward pass and a context length of 128K tokens. Trained on 14.8T tokens using FP8 mixed precision, it achieves high training efficiency and stability, with strong performance across language, reasoning, math, and coding tasks.

Basic Information

Developer
deepseek
Model Series
DeepSeek
Release Date
2025-08-20
Context Length
163,840 tokens
Variant
standard

Pricing Information

Prompt Tokens
$0.20 / 1M tokens
Completion Tokens
$0.80 / 1M tokens

Data Policy

Terms of Service

학습 정책

1

Supported Features

Supported (9)

Top K
Seed
Frequency Penalty
Presence Penalty
Repetition Penalty
Min P
Logit Bias
Logprobs
Top Logprobs

Unsupported (7)

Image Input
Response Format
Tool Usage
Structured Outputs
Reasoning
Web Search Options
Top A

Actual Usage Statistics

No recent usage data available.

Models by Same Author (deepseek)

DeepSeek V3.1
163,840 tokens
$0.20 / $0.80
DeepSeek V3.1 (free)
64,000 tokens
Free
DeepSeek V3.1 (thinking)
131,072 tokens
$0.55 / $2.19
R1 Distill Qwen 7B
131,072 tokens
$0.00 / $0.00
Deepseek R1 0528 Qwen3 8B (free)
131,072 tokens
Free

Similar Price Range Models

DeepSeek V3.1
deepseek
163,840 tokens
$0.20 / $0.80
R1 0528
deepseek
163,840 tokens
$0.20 / $0.80
Qwen3 Coder 480B A35B
qwen
262,144 tokens
$0.20 / $0.80
DeepSeek V3 0324
deepseek
163,840 tokens
$0.20 / $0.80
MAI DS R1
microsoft
163,840 tokens
$0.20 / $0.80