DiffRhythm: Lyrics to Song Text to Audio
fal-ai/diffrhythm
DiffRhythm is a blazing fast model for transforming lyrics into full songs. It boasts the capability to generate full songs in less than 30 seconds.
Inference
Commercial use
Input
Hint: you can drag and drop file(s) here, or provide a base64 encoded data URL Accepted file types: mp3, ogg, wav, m4a, aac
Additional Settings
Customize your input with more control.
Related Models
fal-ai/mmaudio-v2/text-to-audio
text-to-audio
MMAudio generates synchronized audio given text inputs. It can generate sounds described by a prompt.
audio
fast
fal-ai/kokoro/british-english
text-to-audio
A high-quality British English text-to-speech model offering natural and expressive voice synthesis.
speech
fal-ai/kokoro/french
text-to-audio
An expressive and natural French text-to-speech model for both European and Canadian French.
speech