CSM-1B Text to Audio
fal-ai/csm-1b
CSM (Conversational Speech Model) is a speech generation model from Sesame that generates RVQ audio codes from text and audio inputs.
Inference
Research only
Input
Additional Settings
Customize your input with more control.
Result
Idle
Your request will cost $0.03 per thousand character.