Nemotron Nano 9B V2 Check detailed information and pricing for AI models

Context Length 131,072 tokens, nvidia from provided

131,072
Context Tokens
$0.04
Prompt Price
$0.16
Output Price
9/16
Feature Support

Model Overview

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and tasks by first generating a reasoning trace and then concluding with a final response. The model's reasoning capabilities can be controlled via a system prompt. If the user prefers the model to provide its final answer without intermediate reasoning traces, it can be configured to do so.

Basic Information

Developer
nvidia
Model Series
Other
Release Date
2025-09-05
Context Length
131,072 tokens
Variant
standard

Pricing Information

Prompt Tokens
$0.04 / 1M tokens
Completion Tokens
$0.16 / 1M tokens

Supported Features

Supported (9)

Top K
Seed
Frequency Penalty
Presence Penalty
Repetition Penalty
Response Format
Min P
Tool Usage
Reasoning

Unsupported (7)

Image Input
Logit Bias
Logprobs
Top Logprobs
Structured Outputs
Web Search Options
Top A

Other Variants

Actual Usage Statistics

No recent usage data available.

Models by Same Author (nvidia)

Llama 3.1 Nemotron Nano 8B v1
131,072 tokens
$0.00 / $0.00
Llama 3.3 Nemotron Super 49B v1 (free)
131,072 tokens
Free
Llama 3.3 Nemotron Super 49B v1
131,072 tokens
$0.00 / $0.00
Llama 3.1 Nemotron Ultra 253B v1 (free)
131,072 tokens
Free
Llama 3.1 Nemotron Ultra 253B v1
131,072 tokens
$0.60 / $1.80

Similar Price Range Models

Skyfall 36B V2
thedrummer
32,768 tokens
$0.04 / $0.16
Mistral Small 3.1 24B
mistralai
131,072 tokens
$0.04 / $0.15
Mistral Small 3
mistralai
32,768 tokens
$0.04 / $0.15
Command R7B (12-2024)
cohere
128,000 tokens
$0.04 / $0.15
gpt-oss-20b
openai
131,000 tokens
$0.04 / $0.15