fal-ai/flashtalk
Audio-driven talking avatar generation powered by the SoulX-FlashTalk 14B model.
Inference
Commercial use
Input
Hint: Drag and drop image files from your computer, images from web pages, paste from clipboard (Ctrl/Cmd+V), or provide a URL. Accepted file types: jpg, jpeg, png, webp, gif, avif

Hint: Drag and drop audio files from your computer, audio from web pages, paste from clipboard (Ctrl/Cmd+V), or provide a URL. Accepted file types: mp3, ogg, wav, m4a, aac
Additional Settings
Customize your input with more control.
Result
Idle
Waiting for your input...
What would you like to do next?
Your request will cost $0.02 per second.