CSM-1B Text to Audio
fal-ai/csm-1b
CSM (Conversational Speech Model) is a speech generation model from Sesame that generates RVQ audio codes from text and audio inputs.
Inference
Commercial use
Input
Additional Settings
Customize your input with more control.
Result
Idle
Waiting for your input...
What would you like to do next?
Your request will cost $0.03 per 1000 character.