fal-ai/stable-audio-3/medium/text-to-audio

Stable Audio 3 Medium is a 1.4 billion parameter latent diffusion model that generates high-quality stereo music up to 6 minutes from text prompts, trained on fully licensed data for safe commercial use.
Inference
Commercial use

Prompt examples

Examples are generated using the Stable Audio 3. You can customize them by clicking on the "Playground" button.

A cinematic piano and string sketch with gentle pulses, hopeful harmony, and spacious reverb.
num_inference_steps12
guidance_scale1
Playground