fal-ai/stable-audio-3/small/music/text-to-audio

Stable Audio 3 Small Music is a 459 million parameter latent diffusion model that generates full stereo music compositions up to 2 minutes from text prompts, lightweight enough for on-device deployment.

Inference

Commercial use

Schema

LLMs

Playground API Examples

Prompt examples

Examples are generated using the Stable Audio 3 Small Music Text to Audio. You can customize them by clicking on the "Playground" button.

22-second instrumental cue: velvet trip-hop noir with tremolo guitar, celeste sparkle, and heavy slow drums. Clean stereo mix, memorable motif, no vocals, no lyrics.

num_inference_steps8

guidance_scale1

seed734546

Playground