Nemotron-4 340B Instruct 查看AI模型的详细信息和价格

上下文长度 4,096 令牌， nvidia 来自提供

4,096

上下文令牌

$0.00

提示价格

$0.00

输出价格

0/16

功能支持

模型介绍

Nemotron-4-340B-Instruct is an English-language chat model optimized for synthetic data generation. This large language model (LLM) is a fine-tuned version of Nemotron-4-340B-Base, designed for single and multi-turn chat use-cases with a 4,096 token context length. The base model was pre-trained on 9 trillion tokens from diverse English texts, 50+ natural languages, and 40+ coding languages. The instruct model underwent additional alignment steps: 1. Supervised Fine-tuning (SFT) 2. Direct Preference Optimization (DPO) 3. Reward-aware Preference Optimization (RPO) The alignment process used approximately 20K human-annotated samples, while 98% of the data for fine-tuning was synthetically generated. Detailed information about the synthetic data generation pipeline is available in the [technical report](https://arxiv.org/html/2406.11704v1).

基本信息

开发商

nvidia

模型系列

Other

发布日期

2024-06-23

上下文长度

4,096 令牌

变体

standard

价格信息

提示令牌

$0.00 / 1M 令牌

完成令牌

$0.00 / 1M 令牌

支持功能

不支持 (16)

图像输入

Top K

种子

频率惩罚

存在惩罚

重复惩罚

响应格式

Min P

Logit偏置

工具使用

Logprobs

Top Logprobs

结构化输出

推理

网络搜索选项

Top A

实际使用量统计