Ai Avatar Image to Video
fal-ai/ai-avatar/multi-text
MultiTalk model generates a multi-person conversation video from an image and text inputs. Converts text to speech for each person, generating a realistic conversation scene.
Inference
Commercial use
Input
Hint: you can drag and drop file(s) here, or provide a base64 encoded data URL Accepted file types: jpg, jpeg, png, webp, gif, avif

Additional Settings
Customize your input with more control.
Result
Idle
What would you like to do next?
Your request will cost $0.3 per second of output video.