fal-ai/silero-vad

Detect speech presence and timestamps with accuracy and speed using the ultra-lightweight Silero VAD model
Inference
Commercial use

Input

Result

Idle
{
  "has_speech": true,
  "timestamps": [
    {
      "start": 0.13,
      "end": 1.982
    },
    {
      "start": 2.434,
      "end": 3.998
    }
  ]
}

What would you like to do next?

Your request will cost $0.00001 per second.

Logs