fal-ai/ai-avatar

MultiTalk model generates a talking avatar video from an image and audio file. The avatar lip-syncs to the provided audio with natural facial expressions.
Inference
Commercial use

Input

Type # to reference inputs.

Additional Settings

Customize your input with more control.

Result

Idle

What would you like to do next?

Your request will cost $0.2 per second.

For 720p price will be doubled.

Logs