Nemotron Nano 9B V2 (free) Check detailed information and pricing for AI models

Context Length 128,000 tokens, nvidia from provided

128,000
Context Tokens
Free
Prompt Price
Free
Output Price
4/16
Feature Support

Model Overview

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and tasks by first generating a reasoning trace and then concluding with a final response. The model's reasoning capabilities can be controlled via a system prompt. If the user prefers the model to provide its final answer without intermediate reasoning traces, it can be configured to do so.

Basic Information

Developer
nvidia
Model Series
Other
Release Date
2025-09-05
Context Length
128,000 tokens
Variant
free

Pricing Information

This model is free to use

Supported Features

Supported (4)

Response Format
Tool Usage
Structured Outputs
Reasoning

Unsupported (12)

Image Input
Top K
Seed
Frequency Penalty
Presence Penalty
Repetition Penalty
Min P
Logit Bias
Logprobs
Top Logprobs
Web Search Options
Top A

Other Variants

Actual Usage Statistics

No recent usage data available.

Models by Same Author (nvidia)

Llama 3.1 Nemotron Nano 8B v1
131,072 tokens
$0.00 / $0.00
Llama 3.3 Nemotron Super 49B v1 (free)
131,072 tokens
Free
Llama 3.3 Nemotron Super 49B v1
131,072 tokens
$0.00 / $0.00
Llama 3.1 Nemotron Ultra 253B v1 (free)
131,072 tokens
Free
Llama 3.1 Nemotron Ultra 253B v1
131,072 tokens
$0.60 / $1.80