Llama 3.1 Nemotron 70B Instruct Check detailed information and pricing for AI models
Context Length 131,072 tokens, nvidia from provided
131,072
Context Tokens
$0.12
Prompt Price
$0.30
Output Price
11/16
Feature Support
Model Overview
NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging [Llama 3.1 70B](/models/meta-llama/llama-3.1-70b-instruct) architecture and Reinforcement Learning from Human Feedback (RLHF), it excels in automatic alignment benchmarks. This model is tailored for applications requiring high accuracy in helpfulness and response generation, suitable for diverse user queries across multiple domains. Usage of this model is subject to [Meta's Acceptable Use Policy](https://www.llama.com/llama3/use-policy/).
Basic Information
Developer
nvidia
Model Series
Llama3
Release Date
2024-10-15
Context Length
131,072 tokens
Max Completion Tokens
131,072 tokens
Variant
standard
Pricing Information
Prompt Tokens
$0.12 / 1M tokens
Completion Tokens
$0.30 / 1M tokens
Data Policy
Supported Features
Supported (11)
Top K
Seed
Frequency Penalty
Presence Penalty
Repetition Penalty
Response Format
Min P
Logit Bias
Tool Usage
Logprobs
Top Logprobs
Unsupported (5)
Image Input
Structured Outputs
Reasoning
Web Search Options
Top A
Actual Usage Statistics
#153
Out of 345 total models
924.96M
Total Tokens Last 30 Days
30.83M
Daily Average Usage
2%
Weekly Usage Change