fal-ai/silero-vad

Detect speech presence and timestamps with accuracy and speed using the ultra-lightweight Silero VAD model
Inference
Commercial use

Input

Result

Idle
{
  "has_speech": true,
  "timestamps": [
    {
      "end": 1.982,
      "start": 0.13
    },
    {
      "end": 3.998,
      "start": 2.434
    }
  ]
}

What would you like to do next?

Your request will cost $0.00001 per second.

Logs