Loading...
Loading...
NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and tasks by first generating a reasoning trace and then concluding with a final response. The model's reasoning capabilities can be controlled via a system prompt. If the user prefers the model to provide its final answer without intermediate reasoning traces, it can be configured to do so.
Price/1M
$0.07
208th cheapest
77% below median
Top 31%
Context Window
131K
145th largest
Top 63%
Input
$0.04
per 1M tokens
Output
$0.16
per 1M tokens
Blended
$0.07
per 1M tokens
Cheaper than 69% of models. Median price is $0.31/1M tokens.
Daily
$0.07
Monthly
$2.10
Context Window
131K
tokens
Larger than 37% of models
Context Window Comparison