Skip to content

Nvidia Nemotron Nano 12B V2 VL

Nvidia Nemotron Nano 12B V2 VL is NVIDIA's open 12B multimodal reasoning model with a hybrid Mamba-Transformer architecture, OCRBenchV2 results, and specialized support for document intelligence, video understanding, and RAG pipelines.

ReasoningTool UseVision (Image)
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'nvidia/nemotron-nano-12b-v2-vl',
prompt: 'Why is the sky blue?'
})

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
DeepInfra
131K
0.1s
298tps
$0.20/M$0.60/M
+1
12/01/2024
Amazon Bedrock
131K
0.2s
$0.20/M$0.60/M
+1
12/01/2024