Nemotron-4 340B Instruct 查看AI模型的詳細資訊和價格

上下文長度 4,096 代幣， nvidia 來自提供

4,096

上下文權杖

$0.00

提示價格

$0.00

輸出價格

0/16

功能支援

模型介紹

Nemotron-4-340B-Instruct is an English-language chat model optimized for synthetic data generation. This large language model (LLM) is a fine-tuned version of Nemotron-4-340B-Base, designed for single and multi-turn chat use-cases with a 4,096 token context length. The base model was pre-trained on 9 trillion tokens from diverse English texts, 50+ natural languages, and 40+ coding languages. The instruct model underwent additional alignment steps: 1. Supervised Fine-tuning (SFT) 2. Direct Preference Optimization (DPO) 3. Reward-aware Preference Optimization (RPO) The alignment process used approximately 20K human-annotated samples, while 98% of the data for fine-tuning was synthetically generated. Detailed information about the synthetic data generation pipeline is available in the [technical report](https://arxiv.org/html/2406.11704v1).

基本資訊

開發商

nvidia

模型系列

Other

發布日期

2024-06-23

上下文長度

4,096 令牌

變體

standard

價格資訊

提示令牌

$0.00 / 1M 代幣

完成令牌

$0.00 / 1M 代幣

支援功能

不支援 (16)

圖像輸入

Top K

種子

頻率懲罰

存在懲罰

重複懲罰

回應格式

Min P

Logit偏置

工具使用

Logprobs

Top Logprobs

結構化輸出

推理

網路搜尋選項

Top A

實際使用量統計