fal-ai/stable-audio-3/small/music/text-to-audio
Stable Audio 3 Small Music is a 459 million parameter latent diffusion model that generates full stereo music compositions up to 2 minutes from text prompts, lightweight enough for on-device deployment.
Inference
Commercial use
Prompt examples
Examples are generated using the Stable Audio 3 Small Music Text to Audio. You can customize them by clicking on the "Playground" button.
22-second instrumental cue: velvet trip-hop noir with tremolo guitar, celeste sparkle, and heavy slow drums. Clean stereo mix, memorable motif, no vocals, no lyrics.
num_inference_steps8
guidance_scale1
seed734546