fal-ai/zonos2
Zonos2 is a text-to-speech model that clones a voice from a short sample and speaks naturally across many languages.
Inference
Commercial use
Input
Hint: Drag and drop audio files from your computer, audio from web pages, paste from clipboard (Ctrl/Cmd+V), or provide a URL. Accepted file types: mp3, ogg, wav, m4a, aac
Additional Settings
Customize your input with more control.