Llama 3.1 Nemotron Ultra 253B v1 AI 모델의 상세 정보와 가격을 확인하세요

컨텍스트 길이 131,072 토큰, nvidia 에서 제공

131,072

컨텍스트 토큰

$0.60

프롬프트 가격

$1.80

출력 가격

8/16

기능 지원

모델 소개

Llama-3.1-Nemotron-Ultra-253B-v1 is a large language model (LLM) optimized for advanced reasoning, human-interactive chat, retrieval-augmented generation (RAG), and tool-calling tasks. Derived from Meta’s Llama-3.1-405B-Instruct, it has been significantly customized using Neural Architecture Search (NAS), resulting in enhanced efficiency, reduced memory usage, and improved inference latency. The model supports a context length of up to 128K tokens and can operate efficiently on an 8x NVIDIA H100 node. Note: you must include `detailed thinking on` in the system prompt to enable reasoning. Please see [Usage Recommendations](https://huggingface.co/nvidia/Llama-3_1-Nemotron-Ultra-253B-v1#quick-start-and-usage-recommendations) for more.

기본 정보

개발사

nvidia

모델 시리즈

Llama3

출시일

2025-04-08

컨텍스트 길이

131,072 토큰

변형

standard

가격 정보

프롬프트 토큰

$0.60 / 1M 토큰

완료 토큰

$1.80 / 1M 토큰

데이터 정책

이용약관 개인정보 처리방침

지원 기능

지원됨 (8)

Top K

Seed

Frequency Penalty

Presence Penalty

Logit Bias

Logprobs

Top Logprobs

추론

미지원 (8)

이미지 입력

Repetition Penalty

Response Format

Min P

도구 사용

구조화된 출력

Web Search Options

Top A

다른 변형

Llama 3.1 Nemotron Ultra 253B v1 (free)

free

무료

실제 사용량 통계

#243

전체 353개 모델 중

228.90M

최근 30일 총 토큰

7.63M

일평균 사용량

45%

주간 사용량 변화

최근 30일 사용량 추이

동일 제작사 모델 (nvidia)

Nemotron Nano 9B V2 (free)

128,000 토큰

무료

상세보기

Nemotron Nano 9B V2

131,072 토큰

$0.04 / $0.16

상세보기

Llama 3.1 Nemotron Nano 8B v1

131,072 토큰

$0.00 / $0.00

상세보기

Llama 3.3 Nemotron Super 49B v1 (free)

131,072 토큰

무료

상세보기

Llama 3.3 Nemotron Super 49B v1

131,072 토큰

$0.00 / $0.00

상세보기

유사 가격대 모델

GLM 4.5V

z-ai

65,536 토큰

$0.50 / $1.80

상세보기

Command R

cohere

128,000 토큰

$0.50 / $1.50

상세보기

Magistral Small 2506

mistralai

40,000 토큰

$0.50 / $1.50

상세보기

Command R (03-2024)

cohere

128,000 토큰

$0.50 / $1.50

상세보기

GPT-3.5 Turbo

openai

16,385 토큰

$0.50 / $1.50

상세보기