Llama 3.1 Nemotron Nano 8B v1 Detaillierte Informationen und Preise für AI-Modelle anzeigen

Kontext Länge 131,072 Token, nvidia von bereitgestellt

131,072

Kontext-Token

$0.00

Prompt-Preis

$0.00

Ausgabepreis

0/16

Funktionsunterstützung

Modell-Übersicht

Llama-3.1-Nemotron-Nano-8B-v1 is a compact large language model (LLM) derived from Meta's Llama-3.1-8B-Instruct, specifically optimized for reasoning tasks, conversational interactions, retrieval-augmented generation (RAG), and tool-calling applications. It balances accuracy and efficiency, fitting comfortably onto a single consumer-grade RTX GPU for local deployment. The model supports extended context lengths of up to 128K tokens. Note: you must include `detailed thinking on` in the system prompt to enable reasoning. Please see [Usage Recommendations](https://huggingface.co/nvidia/Llama-3_1-Nemotron-Ultra-253B-v1#quick-start-and-usage-recommendations) for more.

Grundinformationen

Entwickler

nvidia

Modellserie

Other

Veröffentlichungsdatum

2025-04-08

Kontextlänge

131,072 Token

Variante

standard

Preisinformationen

Prompt-Token

$0.00 / 1M Token

Vervollständigungs-Token

$0.00 / 1M Token

Unterstützte Funktionen

Nicht unterstützt (16)

Bildeingabe

Top K

Seed

Häufigkeitsstrafe

Presence Penalty

Wiederholungsstrafe

Antwortformat

Min P

Logit-Bias

Tool-Nutzung

Logprobs

Top Logprobs

Strukturierte Ausgaben

Schlussfolgerung

Web-Suchoptionen

Top A

Tatsächliche Nutzungsstatistiken

Keine aktuellen Nutzungsdaten verfügbar.

Modelle desselben Autors (nvidia)

Nemotron Nano 9B V2 (free)

128,000 Token

Kostenlos

Details anzeigen

Nemotron Nano 9B V2

131,072 Token

$0.04 / $0.16

Details anzeigen

Llama 3.3 Nemotron Super 49B v1 (free)

131,072 Token

Kostenlos

Details anzeigen

Llama 3.3 Nemotron Super 49B v1

131,072 Token

$0.00 / $0.00

Details anzeigen