DeepSeek V3.1 Base Consulta la información detallada y precios de los modelos de AI

Contexto Longitud 163,840 tokens, deepseek de proporcionado

163,840

Tokens de Contexto

$0.25

Precio del Prompt

$1.00

Precio de salida

9/16

Soporte de Funciones

Introducción del Modelo

This is a base model, trained only for raw next-token prediction. Unlike instruct/chat models, it has not been fine-tuned to follow user instructions. Prompts need to be written more like training text or examples rather than simple requests (e.g., “Translate the following sentence…” instead of just “Translate this”). DeepSeek-V3.1 Base is a 671B parameter open Mixture-of-Experts (MoE) language model with 37B active parameters per forward pass and a context length of 128K tokens. Trained on 14.8T tokens using FP8 mixed precision, it achieves high training efficiency and stability, with strong performance across language, reasoning, math, and coding tasks.

Información Básica

Desarrollador

deepseek

Serie de Modelos

DeepSeek

Fecha de lanzamiento

2025-08-20

Longitud de Contexto

163,840 tokens

Variante

standard

Información de Precios

Tokens de Prompt

$0.25 / 1M tokens

Tokens de Completado

$1.00 / 1M tokens

Política de Datos

Términos de Servicio

학습 정책

Funciones Compatibles

Compatible (9)

Top K

Seed

Penalización de Frecuencia

Penalización de Presencia

Penalización de Repetición

Min P

Sesgo Logit

Logprobs

Top Logprobs

No compatible (7)

Entrada de Imagen

Formato de Respuesta

Uso de Herramientas

Salidas Estructuradas

Razonamiento

Opciones de Búsqueda Web

Top A

Estadísticas de Uso Real

No hay datos de uso recientes disponibles.

Modelos del Mismo Autor (deepseek)

DeepSeek V3.1

163,840 tokens

$0.25 / $1.00

Ver Detalles

DeepSeek V3.1 (free)

32,768 tokens

Gratis

Ver Detalles

DeepSeek V3.1 (thinking)

131,072 tokens

$0.55 / $2.19

Ver Detalles

R1 Distill Qwen 7B

131,072 tokens

$0.00 / $0.00

Ver Detalles