fal-ai/elevenlabs/speech-to-text/scribe-v2
Use Scribe-V2 from ElevenLabs to do blazingly fast speech to text inferences!
Inference
Commercial use
Partner
Input
Hint: Drag and drop audio files from your computer, audio from web pages, paste from clipboard (Ctrl/Cmd+V), or provide a URL. Accepted file types: mp3, ogg, wav, m4a, aac
Additional Settings
Customize your input with more control.
Result
Idle
Hey, this is a test recording for Scribe version two, which is now available on fal.aiWhat would you like to do next?
{
"text": "Hey, this is a test recording for Scribe version two, which is now available on fal.ai",
"language_code": "eng",
"language_probability": 1,
"words": [
{
"text": "Hey,",
"start": 0.079,
"end": 0.539,
"type": "word",
"speaker_id": "speaker_0"
},
{
"text": " ",
"start": 0.539,
"end": 0.599,
"type": "spacing",
"speaker_id": "speaker_0"
},
{
"text": "this",
"start": 0.599,
"end": 0.679,
"type": "word",
"speaker_id": "speaker_0"
},
{
"text": " ",
"start": 0.679,
"end": 0.739,
"type": "spacing",
"speaker_id": "speaker_0"
},
{
"text": "is",
"start": 0.739,
"end": 0.799,
"type": "word",
"speaker_id": "speaker_0"
},
{
"text": " ",
"start": 0.799,
"end": 0.939,
"type": "spacing",
"speaker_id": "speaker_0"
},
{
"text": "a",
"start": 0.939,
"end": 0.939,
"type": "word",
"speaker_id": "speaker_0"
},
{
"text": " ",
"start": 0.939,
"end": 0.959,
"type": "spacing",
"speaker_id": "speaker_0"
},
{
"text": "test",
"start": 0.959,
"end": 1.179,
"type": "word",
"speaker_id": "speaker_0"
},
{
"text": " ",
"start": 1.179,
"end": 1.219,
"type": "spacing",
"speaker_id": "speaker_0"
},
{
"text": "recording",
"start": 1.22,
"end": 1.719,
"type": "word",
"speaker_id": "speaker_0"
},
{
"text": " ",
"start": 1.719,
"end": 1.719,
"type": "spacing",
"speaker_id": "speaker_0"
},
{
"text": "for",
"start": 1.719,
"end": 1.86,
"type": "word",
"speaker_id": "speaker_0"
},
{
"text": " ",
"start": 1.86,
"end": 1.879,
"type": "spacing",
"speaker_id": "speaker_0"
},
{
"text": "Scribe",
"start": 1.879,
"end": 2.24,
"type": "word",
"speaker_id": "speaker_0"
},
{
"text": " ",
"start": 2.24,
"end": 2.319,
"type": "spacing",
"speaker_id": "speaker_0"
},
{
"text": "version",
"start": 2.319,
"end": 2.759,
"type": "word",
"speaker_id": "speaker_0"
},
{
"text": " ",
"start": 2.759,
"end": 2.779,
"type": "spacing",
"speaker_id": "speaker_0"
},
{
"text": "two,",
"start": 2.779,
"end": 3.379,
"type": "word",
"speaker_id": "speaker_0"
},
{
"text": " ",
"start": 3.379,
"end": 3.399,
"type": "spacing",
"speaker_id": "speaker_0"
},
{
"text": "which",
"start": 3.399,
"end": 3.519,
"type": "word",
"speaker_id": "speaker_0"
},
{
"text": " ",
"start": 3.519,
"end": 3.539,
"type": "spacing",
"speaker_id": "speaker_0"
},
{
"text": "is",
"start": 3.539,
"end": 3.659,
"type": "word",
"speaker_id": "speaker_0"
},
{
"text": " ",
"start": 3.659,
"end": 3.699,
"type": "spacing",
"speaker_id": "speaker_0"
},
{
"text": "now",
"start": 3.699,
"end": 3.839,
"type": "word",
"speaker_id": "speaker_0"
},
{
"text": " ",
"start": 3.839,
"end": 3.839,
"type": "spacing",
"speaker_id": "speaker_0"
},
{
"text": "available",
"start": 3.839,
"end": 4.319,
"type": "word",
"speaker_id": "speaker_0"
},
{
"text": " ",
"start": 4.319,
"end": 4.339,
"type": "spacing",
"speaker_id": "speaker_0"
},
{
"text": "on",
"start": 4.339,
"end": 4.579,
"type": "word",
"speaker_id": "speaker_0"
},
{
"text": " ",
"start": 4.579,
"end": 4.599,
"type": "spacing",
"speaker_id": "speaker_0"
},
{
"text": "fal.ai",
"start": 4.599,
"end": 5.699,
"type": "word",
"speaker_id": "speaker_0"
}
]
}Your request will cost $0.008 per input audio minutes. If keyterm is used, you request will cost %30 more.