google/gemini-omni-flash/image-to-video

Animates a still image into video with audio. Extends a single frame into coherent motion, grounded in Gemini's physical understanding of how scenes and subjects behave.
Inference
Commercial use
Partner

Input

Type # to reference inputs.

Result

Idle

What would you like to do next?

Billing is based on total token consumption. Input tokens (text/audio/video) cost $1.875 per 1 million tokens. Output tokens cost $21.875 per 1 million tokens. For 720p video this costs approximately $0.13 per second of video.

Logs