Run models all in one Sandbox 🏖️
Available now on fal.ai

ElevenLabsThe Voice of AI

One of the most natural-sounding AI voice models. Text-to-speech, music generation, dubbing, and transcription, all through a single serverless API.



One API. Every Audio Modality.

Text to Speech

Voices That Sound Human

Eleven V3 produces speech with natural intonation, emotional nuance, and accurate pacing. Over 20 built-in voices across narration, conversational, character, and broadcast styles. Turbo v2.5 delivers under 75ms latency for real-time applications.

Music & Sound

Studio-Quality Audio from Text

Generate full music tracks in any genre from a text description. Control structure with section prompts, choose vocal or instrumental mode, and export in 19 formats.

Speech to Text

99-Language Transcription

Scribe v2 transcribes audio in 99 languages with word-level timestamps, speaker diarization, and audio event detection. At $0.008 per minute, it is one of the most cost-effective transcription APIs available.


Examples

Hear what ElevenLabs can create

Hit play on any example below, then copy the prompt and try it yourself in the TTS or Music playground.

Text to Speech
Natural dialogue with emotion & pacing

"The old lighthouse keeper adjusted his glasses and peered through the storm. 'Thirty years,' he muttered, 'and the sea still finds new ways to surprise me.'"

Music Generation
Lo-fi jazz hip-hop with texture & mood

"Lo-fi jazz hip-hop with a dusty vinyl crackle, mellow Rhodes piano chords, a slow boom-bap drum pattern, and a wandering saxophone melody that feels like 2am in a coffee shop"

Text to Speech
Broadcast narration with professional tone

"Breaking news update: financial markets responded positively to the announcement, with major indices climbing throughout the afternoon session. Analysts expect continued momentum into next week."

Music Generation
Dark cinematic trailer score

"Dark cinematic trailer music with deep brass stabs, thunderous taiko drums, a rising tension string ostinato, and a massive sub-bass drop that hits like a shockwave"

For Developers

A few lines of code.
Lifelike audio.

fal.ai handles the infrastructure: fast inference, auto-scaling, and a developer-friendly API. No GPUs to manage.

  • Serverless: scales to zero, scales to millions
  • Pay per use, no minimums
  • Python and JavaScript SDKs, plus REST API
import fal_client

result = fal_client.run(
  "fal-ai/elevenlabs/tts/eleven-v3",
  arguments={
    "text": "Hello from fal.ai!",
    "voice": "Sarah",
  }
)

# result.audio.url → your generated audio
FAQ

Common questions about ElevenLabs

What ElevenLabs models are available on fal.ai?

fal.ai offers the full ElevenLabs audio suite: Eleven V3 (latest TTS), Turbo v2.5 (low-latency TTS), Multilingual v2 (29 languages), text-to-dialogue, music generation, speech-to-text (Scribe v1 & v2), voice changer, and dubbing.

How natural does the text-to-speech sound?

ElevenLabs is widely regarded as the most natural-sounding AI voice platform. Eleven V3, their latest model, produces speech with nuanced intonation, emotional awareness, and natural pacing. It supports over 20 built-in voices across different styles: narration, conversational, characters, and broadcast.

What languages are supported?

Multilingual v2 supports 29 languages with high accent accuracy. Turbo v2.5 supports 32 languages with the lowest latency. Scribe v2 transcribes 99 languages. Dubbing auto-translates audio into Spanish, French, German, Japanese, Portuguese, and Chinese.

Can I generate music?

Yes. ElevenLabs Music generates studio-quality tracks from text prompts in any genre. It supports both vocal and instrumental modes, section control for structuring compositions, and 19 output formats. Pricing is $0.80 per minute of generated audio.

How much does ElevenLabs cost on fal.ai?

TTS: $0.10 per 1,000 characters (Eleven V3, Multilingual v2) or $0.05 per 1,000 characters (Turbo v2.5). Speech-to-text: $0.008/min (Scribe v2) or $0.03/min (Scribe v1). Music: $0.80/min. Voice changer: $0.30/min. Dubbing: $0.90/min. Pay-per-use with no minimums.

How do I get started with the API?

Install the fal.ai SDK (Python or JavaScript), grab an API key from your dashboard, and make your first request in a few lines of code. The API is serverless, so no infrastructure to set up. Check the API documentation for all available parameters and voice options.

Can I use ElevenLabs for commercial projects?

Yes. Content generated through the fal.ai API can be used in commercial projects. Check fal.ai's terms of service for full details on usage rights and licensing.

Ready to build with voice?

Start generating lifelike audio with ElevenLabs on fal.ai.