Infinitalk Text to Video
fal-ai/infinitalk/single-text
Infinitalk model generates a talking avatar video from a text and audio file. The avatar lip-syncs to the provided audio with natural facial expressions.
Inference
Commercial use
Input
Hint: Drag and drop image files from your computer, images from web pages, paste from clipboard (Ctrl/Cmd+V), or provide a URL. Accepted file types: jpg, jpeg, png, webp, gif, avif

Additional Settings
Customize your input with more control.
Result
Idle
What would you like to do next?
Your request will cost $0 per compute second.
For 720p price will be doubled.