ElevenLabsThe Voice of AI

One of the most natural-sounding AI voice models. Text-to-speech, music generation, dubbing, and transcription, all through a single serverless API.

One API. Every Audio Modality.

Text to Speech

Voices That Sound Human

Eleven V3 produces speech with natural intonation, emotional nuance, and accurate pacing. Over 20 built-in voices across narration, conversational, character, and broadcast styles. Turbo v2.5 delivers under 75ms latency for real-time applications.

Music & Sound

Studio-Quality Audio from Text

Generate full music tracks in any genre from a text description. Control structure with section prompts, choose vocal or instrumental mode, and export in 19 formats.

Speech to Text

99-Language Transcription

Scribe v2 transcribes audio in 99 languages with word-level timestamps, speaker diarization, and audio event detection. At $0.008 per minute, it is one of the most cost-effective transcription APIs available.

Examples

Hear what ElevenLabs can create

Hit play on any example below, then copy the prompt and try it yourself in the TTS or Music playground.

Text to Speech

Natural dialogue with emotion & pacing

"The old lighthouse keeper adjusted his glasses and peered through the storm. 'Thirty years,' he muttered, 'and the sea still finds new ways to surprise me.'"

Music Generation

Lo-fi jazz hip-hop with texture & mood

"Lo-fi jazz hip-hop with a dusty vinyl crackle, mellow Rhodes piano chords, a slow boom-bap drum pattern, and a wandering saxophone melody that feels like 2am in a coffee shop"

Text to Speech

Broadcast narration with professional tone

"Breaking news update: financial markets responded positively to the announcement, with major indices climbing throughout the afternoon session. Analysts expect continued momentum into next week."

Music Generation

Dark cinematic trailer score

"Dark cinematic trailer music with deep brass stabs, thunderous taiko drums, a rising tension string ostinato, and a massive sub-bass drop that hits like a shockwave"

API Documentation

How to get access to the ElevenLabs API

The client handles the submit protocol — request status updates and the final result when generation completes.

import { fal } from "@fal-ai/client";

const result = await fal.subscribe("fal-ai/elevenlabs/tts/eleven-v3", {
  input: {
    text: "Hello from fal.ai!",
    voice: "Sarah",
  },
  logs: true,
  onQueueUpdate: (update) => {
    if (update.status === "IN_PROGRESS") {
      update.logs.map((log) => log.message).forEach(console.log);
    }
  },
});

console.log(result.data);
console.log(result.requestId);
FAQ

Common questions about ElevenLabs

What ElevenLabs models are available on fal.ai?

fal.ai offers the full ElevenLabs audio suite: Eleven V3 (latest TTS), Turbo v2.5 (low-latency TTS), Multilingual v2 (29 languages), text-to-dialogue, music generation, speech-to-text (Scribe v1 & v2), voice changer, and dubbing.

How natural does the text-to-speech sound?

ElevenLabs is widely regarded as the most natural-sounding AI voice platform. Eleven V3, their latest model, produces speech with nuanced intonation, emotional awareness, and natural pacing. It supports over 20 built-in voices across different styles: narration, conversational, characters, and broadcast.

What languages are supported?

Multilingual v2 supports 29 languages with high accent accuracy. Turbo v2.5 supports 32 languages with the lowest latency. Scribe v2 transcribes 99 languages. Dubbing auto-translates audio into Spanish, French, German, Japanese, Portuguese, and Chinese.

Can I generate music?

Yes. ElevenLabs Music generates studio-quality tracks from text prompts in any genre. It supports both vocal and instrumental modes, section control for structuring compositions, and 19 output formats. Pricing is $0.80 per minute of generated audio.

How much does ElevenLabs cost on fal.ai?

TTS: $0.10 per 1,000 characters (Eleven V3, Multilingual v2) or $0.05 per 1,000 characters (Turbo v2.5). Speech-to-text: $0.008/min (Scribe v2) or $0.03/min (Scribe v1). Music: $0.80/min. Voice changer: $0.30/min. Dubbing: $0.90/min. Pay-per-use with no minimums.

How do I get started with the API?

Install the fal.ai SDK (Python or JavaScript), grab an API key from your dashboard, and make your first request in a few lines of code. The API is serverless, so no infrastructure to set up. Check the API documentation for all available parameters and voice options.

Can I use ElevenLabs for commercial projects?

Yes. Content generated through the fal.ai API can be used in commercial projects. Check fal.ai's terms of service for full details on usage rights and licensing.

Get in touch about ElevenLabs

Want to learn more about integrating ElevenLabs into your workflow? Leave your details and our team will reach out.

Contact Sales