nvidia/cosmos-3-super/image-to-video

Cosmos3 is a collection of Omnimodal world models capable of generating dynamic, high-quality video, image, audio, and action commands from combinations of text, image, video, and action trajectory inputs.

Learn more about Cosmos

Inference

Commercial use

Schema

LLMs

Playground API Examples

Prompt examples

Examples are generated using the Cosmos 3 Super Image to Video. You can customize them by clicking on the "Playground" button.

lone astronaut in a detailed white spacesuit standing on the gray cratered lunar surface, vivid blue and white Earth rising in the pitch-black starry sky, cinematic lighting, photorealistic, sharp reflections on the visor

num_inference_steps28

guidance_scale6

image_size832x480

Playground