google/gemini-omni-flash/edit

Edits generated video across multiple conversational turns while preserving scene coherence. Applies iterative changes through natural-language instructions without regenerating the full sequence from scratch.

Learn more about Gemini Omni Flash

Inference

Commercial use

Partner

Schema

LLMs

Playground API Examples

Input

Prompt*

Type # to reference inputs.

Video URL*

Hint: Drag and drop video files from your computer, video from web pages, paste from clipboard (Ctrl/Cmd+V), or provide a URL. Accepted file types: mp4, mov, webm, m4v, gif

Result

Idle

What would you like to do next?

Download

{
  "video": {
    "url": "https://v3b.fal.media/files/b/0aa06543/gOhvwdhScjGzIWSH3M6Hx_e5b3e0d6942d426da3b9c7fb8deb9375.mp4",
    "content_type": "video/mp4",
    "file_name": "e5b3e0d6942d426da3b9c7fb8deb9375.mp4",
    "file_size": 1972639
  }
}

Billing is based on total token consumption. Input tokens (text/audio/video) cost $1.875 per 1 million tokens. Output tokens cost $21.875 per 1 million tokens. For 720p video this costs approximately $0.13 per second of video.

google/gemini-omni-flash/edit

Input

Result

What would you like to do next?

Logs