Llama 3.1 Nemotron Nano 8B v1 查看AI模型的详细信息和价格

上下文长度 131,072 令牌， nvidia 来自提供

131,072

上下文令牌

$0.00

提示价格

$0.00

输出价格

0/16

功能支持

模型介绍

Llama-3.1-Nemotron-Nano-8B-v1 is a compact large language model (LLM) derived from Meta's Llama-3.1-8B-Instruct, specifically optimized for reasoning tasks, conversational interactions, retrieval-augmented generation (RAG), and tool-calling applications. It balances accuracy and efficiency, fitting comfortably onto a single consumer-grade RTX GPU for local deployment. The model supports extended context lengths of up to 128K tokens. Note: you must include `detailed thinking on` in the system prompt to enable reasoning. Please see [Usage Recommendations](https://huggingface.co/nvidia/Llama-3_1-Nemotron-Ultra-253B-v1#quick-start-and-usage-recommendations) for more.

基本信息

开发商

nvidia

模型系列

Other

发布日期

2025-04-08

上下文长度

131,072 令牌

变体

standard

价格信息

提示令牌

$0.00 / 1M 令牌

完成令牌

$0.00 / 1M 令牌

支持功能

不支持 (16)

图像输入

Top K

种子

频率惩罚

存在惩罚

重复惩罚

响应格式

Min P

Logit偏置

工具使用

Logprobs

Top Logprobs

结构化输出

推理

网络搜索选项

Top A

实际使用量统计

暂无最近使用量数据。

同作者模型 (nvidia)

Nemotron Nano 9B V2 (free)

128,000 令牌

免费

查看详情

Nemotron Nano 9B V2

131,072 令牌

$0.04 / $0.16

查看详情

Llama 3.3 Nemotron Super 49B v1 (free)

131,072 令牌

免费

查看详情

Llama 3.3 Nemotron Super 49B v1

131,072 令牌

$0.00 / $0.00

查看详情