CSM-1B Text to Audio
fal-ai/csm-1b
CSM (Conversational Speech Model) is a speech generation model from Sesame that generates RVQ audio codes from text and audio inputs.
Inference
Research only
Input
Additional Settings
Customize your input with more control.
Result
Idle
Waiting for your input...
Your request will cost $0.03 per 1000 character.