fal-ai/stable-audio-3/medium/audio-inpainting

Stable Audio 3 Medium audio inpainting is a 1.4 billion parameter latent diffusion model that fills in or reworks selected segments of a stereo track guided by text prompts, supporting single- and multi-segment editing.

Inference

Commercial use

Schema

LLMs

Playground API Examples

Prompt examples

Examples are generated using the Stable Audio 3 Medium Audio Inpainting. You can customize them by clicking on the "Playground" button.

rainy rooftop future soul groove

num_inference_steps8

guidance_scale1

seed730102

Playground

Fill only the silent gap with rainy rooftop future-soul instrumental with muted trumpet stabs, soft Rhodes chords, and crisp rim clicks; create a musical phrase that lands on the surrounding beat and keeps the key coherent. No vocals.

num_inference_steps8

guidance_scale1

seed730102

Playground

Regenerate the masked region as a polished song section that keeps the same emotional direction. Make the result feel soft, focused, and naturally mixed. Give it a fresh city view feeling.

num_inference_steps12

guidance_scale1

Playground

num_inference_steps8

guidance_scale1

seed730102

Playground