fal-ai/stable-audio-3/small/music/text-to-audio

Stable Audio 3 Small Music is a 459 million parameter latent diffusion model that generates full stereo music compositions up to 2 minutes from text prompts, lightweight enough for on-device deployment.
Inference
Commercial use

Prompt examples

Examples are generated using the Stable Audio 3 Small Music Text to Audio. You can customize them by clicking on the "Playground" button.

22-second instrumental cue: velvet trip-hop noir with tremolo guitar, celeste sparkle, and heavy slow drums. Clean stereo mix, memorable motif, no vocals, no lyrics.
num_inference_steps8
guidance_scale1
seed734546
Playground