Ai Avatar Image to Video

fal-ai/ai-avatar/multi-text
MultiTalk model generates a multi-person conversation video from an image and text inputs. Converts text to speech for each person, generating a realistic conversation scene.
Inference
Commercial use

Input

Additional Settings

Customize your input with more control.

Result

Idle

What would you like to do next?

Your request will cost $0.3 per second of output video.

For 720p price will be doubled.

Logs