google/gemini-omni-flash/reference-to-video
Generates video with audio from combined multimodal references. Accepts text, images, audio, and video together as input to guide subject, motion, style, and sound in the output.
Inference
Commercial use
Partner
Prompt examples
Examples are generated using the Gemini Omni Flash. You can customize them by clicking on the "Playground" button.