CSM-1B Text to Audio

fal-ai/csm-1b
CSM (Conversational Speech Model) is a speech generation model from Sesame that generates RVQ audio codes from text and audio inputs.
Inference
Research only

Input

Additional Settings

Customize your input with more control.

Result

Idle

Loading pricing info...

Logs