fal Sandbox is here - run all your models together! 🏖️

CSM-1B Text to Audio

fal-ai/csm-1b
CSM (Conversational Speech Model) is a speech generation model from Sesame that generates RVQ audio codes from text and audio inputs.
Inference
Commercial use

Input

Additional Settings

Customize your input with more control.

Result

Idle

Waiting for your input...

Your request will cost $0.03 per 1000 character.

Logs