DeepSeek V3.1 Base Check detailed information and pricing for AI models
Context Length 163,840 tokens, deepseek from provided
163,840
Context Tokens
$0.20
Prompt Price
$0.80
Output Price
9/16
Feature Support
Model Overview
This is a base model, trained only for raw next-token prediction. Unlike instruct/chat models, it has not been fine-tuned to follow user instructions. Prompts need to be written more like training text or examples rather than simple requests (e.g., “Translate the following sentence…” instead of just “Translate this”). DeepSeek-V3.1 Base is a 671B parameter open Mixture-of-Experts (MoE) language model with 37B active parameters per forward pass and a context length of 128K tokens. Trained on 14.8T tokens using FP8 mixed precision, it achieves high training efficiency and stability, with strong performance across language, reasoning, math, and coding tasks.
Basic Information
Developer
deepseek
Model Series
DeepSeek
Release Date
2025-08-20
Context Length
163,840 tokens
Variant
standard
Pricing Information
Prompt Tokens
$0.20 / 1M tokens
Completion Tokens
$0.80 / 1M tokens
Data Policy
Terms of Service
학습 정책
1
Supported Features
Supported (9)
Top K
Seed
Frequency Penalty
Presence Penalty
Repetition Penalty
Min P
Logit Bias
Logprobs
Top Logprobs
Unsupported (7)
Image Input
Response Format
Tool Usage
Structured Outputs
Reasoning
Web Search Options
Top A