fal-ai/scene-finder

Search any video with a text prompt - Scene Finder locates the matching moments and returns their time segments and extracted frames.
Inference
Commercial use

Input

Type # to reference inputs.

Additional Settings

Customize your input with more control.

Result

Idle
{
  "segments": [
    {
      "start": 10.4,
      "end": 12.5
    }
  ],
  "images": [
    {
      "url": "",
      "content_type": "image/png",
      "file_name": "z9RV14K95DvU.png",
      "file_size": 4404019,
      "width": 1024,
      "height": 1024
    }
  ]
}

What would you like to do next?

Scene Finder is billed by the length of the input video — $0.0042 per second of input video.

Logs