R1 Distill Llama 70B AI 모델의 상세 정보와 가격을 확인하세요

컨텍스트 길이 131,072 토큰, deepseek 에서 제공

131,072

컨텍스트 토큰

$0.03

프롬프트 가격

$0.13

출력 가격

10/16

기능 지원

추론 7위

모델 소개

DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-Instruct](/meta-llama/llama-3.3-70b-instruct), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). The model combines advanced distillation techniques to achieve high performance across multiple benchmarks, including: - AIME 2024 pass@1: 70.0 - MATH-500 pass@1: 94.5 - CodeForces Rating: 1633 The model leverages fine-tuning from DeepSeek R1's outputs, enabling competitive performance comparable to larger frontier models.

기본 정보

개발사

deepseek

모델 시리즈

Llama3

출시일

2025-01-23

컨텍스트 길이

131,072 토큰

변형

standard

가격 정보

프롬프트 토큰

$0.03 / 1M 토큰

완료 토큰

$0.13 / 1M 토큰

데이터 정책

이용약관

학습 정책

지원 기능

지원됨 (10)

Top K

Seed

Frequency Penalty

Presence Penalty

Repetition Penalty

Min P

Logit Bias

Logprobs

Top Logprobs

추론

미지원 (6)

이미지 입력

Response Format

도구 사용

구조화된 출력

Web Search Options

Top A

다른 변형

R1 Distill Llama 70B (free)

free

무료

실제 사용량 통계

#70

전체 353개 모델 중

19.7B

최근 30일 총 토큰

658.17M

일평균 사용량

44%

주간 사용량 변화

최근 30일 사용량 추이

동일 제작사 모델 (deepseek)

DeepSeek V3.1

163,840 토큰

$0.25 / $1.00

상세보기

DeepSeek V3.1 (free)

32,768 토큰

무료

상세보기

DeepSeek V3.1 (thinking)

131,072 토큰

$0.55 / $2.19

상세보기

DeepSeek V3.1 Base

163,840 토큰

$0.25 / $1.00

상세보기

R1 Distill Qwen 7B

131,072 토큰

$0.00 / $0.00

상세보기

유사 가격대 모델

InternVL3 78B

opengvlab

32,768 토큰

$0.03 / $0.13

상세보기

Qwen3 32B

qwen

40,960 토큰

$0.03 / $0.13

상세보기

Dolphin3.0 Mistral 24B

cognitivecomputations

32,768 토큰

$0.03 / $0.11

상세보기