nvidia/nemotron-3-nano-omni/video

Video reasoning variant of NVIDIA's Nemotron 3 Nano Omni. 30B A3B hybrid Transformer-Mamba MoE - accepts video plus a prompt and returns text.
Inference
Commercial use

Input

Additional Settings

Customize your input with more control.

Result

Idle
A woman dressed in a navy blue blazer and white blouse stands in a studio setting with large softbox lights visible on either side of her. She begins by spreading her arms wide with palms facing up, then brings her hands together in front of her waist. As she speaks, she gestures with her hands, at one point raising her index finger to emphasize a point before clasping her hands again and smiling at the camera.

What would you like to do next?

Your request will cost $0.006 per 1000 token.

Logs