fal-ai/stable-audio-3/medium/base/text-to-audio

Stable Audio 3 Medium Base is the foundational 1.4 billion parameter text-to-audio checkpoint generating stereo music up to 6 minutes, intended as the unmodified base for custom fine-tuning workflows.
Inference
Commercial use

Prompt examples

Examples are generated using the Stable Audio 3 Medium Base Text to Audio. You can customize them by clicking on the "Playground" button.

A hopeful cinematic piano piece that slowly opens into strings and subtle electronic percussion. Keep the character smooth, warm, and believable. Give it a clear coastal road feeling.
num_inference_steps25
guidance_scale7
Playground