fal-ai/stable-audio-3/medium/text-to-audio
Stable Audio 3 Medium is a 1.4 billion parameter latent diffusion model that generates high-quality stereo music up to 6 minutes from text prompts, trained on fully licensed data for safe commercial use.
Inference
Commercial use
Prompt examples
Examples are generated using the Stable Audio 3. You can customize them by clicking on the "Playground" button.
num_inference_steps12
guidance_scale1