DeepSeek V3.1 Base Check detailed information and pricing for AI models

Context Length 163,840 tokens, deepseek from provided

163,840

Context Tokens

$0.25

Prompt Price

$1.00

Output Price

9/16

Feature Support

Model Overview

This is a base model, trained only for raw next-token prediction. Unlike instruct/chat models, it has not been fine-tuned to follow user instructions. Prompts need to be written more like training text or examples rather than simple requests (e.g., “Translate the following sentence…” instead of just “Translate this”). DeepSeek-V3.1 Base is a 671B parameter open Mixture-of-Experts (MoE) language model with 37B active parameters per forward pass and a context length of 128K tokens. Trained on 14.8T tokens using FP8 mixed precision, it achieves high training efficiency and stability, with strong performance across language, reasoning, math, and coding tasks.

Basic Information

Developer

deepseek

Model Series

DeepSeek

Release Date

2025-08-20

Context Length

163,840 tokens

Variant

standard

Pricing Information

Prompt Tokens

$0.25 / 1M tokens

Completion Tokens

$1.00 / 1M tokens

Data Policy

학습 정책

Supported Features

Supported (9)

Top K

Seed

Frequency Penalty

Presence Penalty

Repetition Penalty

Min P

Logit Bias

Logprobs

Top Logprobs

Unsupported (7)

Image Input

Response Format

Tool Usage

Structured Outputs

Reasoning

Web Search Options

Top A

Actual Usage Statistics

No recent usage data available.

Models by Same Author (deepseek)

DeepSeek V3.1

163,840 tokens

$0.25 / $1.00

View Details

DeepSeek V3.1 (free)

32,768 tokens

Free

View Details

DeepSeek V3.1 (thinking)

131,072 tokens

$0.55 / $2.19

View Details

R1 Distill Qwen 7B

131,072 tokens

$0.00 / $0.00

View Details