Nemotron Nano 9B V2 (free) Check detailed information and pricing for AI models

Context Length 128,000 tokens, nvidia from provided

128,000

Context Tokens

Free

Prompt Price

Free

Output Price

4/16

Feature Support

Model Overview

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and tasks by first generating a reasoning trace and then concluding with a final response. The model's reasoning capabilities can be controlled via a system prompt. If the user prefers the model to provide its final answer without intermediate reasoning traces, it can be configured to do so.

Basic Information

Developer

nvidia

Model Series

Other

Release Date

2025-09-05

Context Length

128,000 tokens

Variant

free

Pricing Information

This model is free to use

Data Policy

Supported Features

Supported (4)

Response Format

Tool Usage

Structured Outputs

Reasoning

Unsupported (12)

Image Input

Top K

Seed

Frequency Penalty

Presence Penalty

Repetition Penalty

Min P

Logit Bias

Logprobs

Top Logprobs

Web Search Options

Top A

Other Variants

Nemotron Nano 9B V2

standard

$0.04 / $0.16

Actual Usage Statistics

No recent usage data available.

Models by Same Author (nvidia)

Llama 3.1 Nemotron Nano 8B v1

131,072 tokens

$0.00 / $0.00

View Details

Llama 3.3 Nemotron Super 49B v1 (free)

131,072 tokens

Free

View Details

Llama 3.3 Nemotron Super 49B v1

131,072 tokens

$0.00 / $0.00

View Details

Llama 3.1 Nemotron Ultra 253B v1 (free)

131,072 tokens

Free

View Details

Llama 3.1 Nemotron Ultra 253B v1

131,072 tokens

$0.60 / $1.80

View Details