fal-ai/stable-audio-3/medium/text-to-audio

Stable Audio 3 Medium is a 1.4 billion parameter latent diffusion model that generates high-quality stereo music up to 6 minutes from text prompts, trained on fully licensed data for safe commercial use.

Inference

Commercial use

Schema

LLMs

Playground API Examples

Input

Prompt*

Additional Settings

Customize your input with more control.

Result

Idle

What would you like to do next?

Download

{
  "audio": {
    "url": "https://v3b.fal.media/files/b/0a9b9d34/4S7dEpDIOl72FTQS9TPlc_tmpf9z8rn4x.mp3",
    "content_type": "application/octet-stream",
    "file_name": "tmpf9z8rn4x.mp3",
    "file_size": 433884
  },
  "seed": 1525096763,
  "prompt": "A cinematic piano and string sketch with gentle pulses, hopeful harmony, and spacious reverb."
}

Your request will cost $0.0376 per audio.

fal-ai/stable-audio-3/medium/text-to-audio

Input

Result

What would you like to do next?

Logs