deepinfra/nvidia/NVIDIA-Nemotron-Nano-9B-v2

Deepinfra text

deepinfra/nvidia/NVIDIA-Nemotron-Nano-9B-v2

Input

$0.0400

per 1M input tokens

Output

$0.1600

per 1M output tokens

Cache read

n/a

per 1M cached read tokens

Cache write

n/a

per 1M cache write tokens

Context window

131,072

Max output

131,072

Effective date

Jun 15, 2026

Estimate a workload

Enter token counts to see the cost at this model's rates.

Estimated cost: $0.00

Price history

blended $ per 1M tokens (3:1)

One price on record so far (Jun 15, 2026). This chart fills in as the price changes over time. We check daily.

Effective Input Output Blended
Jun 15, 2026 $0.04 $0.16 $0.07

Source: litellm. Confirm against the provider's official pricing before relying on these figures.