CSM-1B Text to Audio
fal-ai/csm-1b
CSM (Conversational Speech Model) is a speech generation model from Sesame that generates RVQ audio codes from text and audio inputs.
Inference
Research only
Input
Additional Settings
Customize your input with more control.
Result
Idle
Loading pricing info...
Logs
Related Models
fal-ai/yue
text-to-audio
YuE is a groundbreaking series of open-source foundation models designed for music generation, specifically for transforming lyrics into full songs.
music
fal-ai/diffrhythm
text-to-audio
DiffRhythm is a blazing fast model for transforming lyrics into full songs. It boasts the capability to generate full songs in less than 30 seconds.
new
music
fal-ai/zonos
text-to-audio
Clone voice of any person and speak anything in their voise using zonos' voice cloning.
voice cloning