fal-ai/stable-audio-3/medium/text-to-audio

Stable Audio 3 Medium is a 1.4 billion parameter latent diffusion model that generates high-quality stereo music up to 6 minutes from text prompts, trained on fully licensed data for safe commercial use.

Inference

Commercial use

Schema

LLMs

Playground API Examples

Prompt examples

Examples are generated using the Stable Audio 3. You can customize them by clicking on the "Playground" button.

A cinematic piano and string sketch with gentle pulses, hopeful harmony, and spacious reverb.

num_inference_steps12

guidance_scale1

Playground