Nano Banana 2 is here 🍌 4x faster, lower cost, better quality

fal-ai/ai-avatar/single-text

MultiTalk model generates a talking avatar video from an image and text. Converts text to speech automatically, then generates the avatar speaking with lip-sync.
Inference
Commercial use

Input

Additional Settings

Customize your input with more control.

Result

Idle

What would you like to do next?

Your request will cost $0.2 per second.

For 720p price will be doubled.

Logs