FLUX.2 is now live!

Kling LipSync Audio-to-Video Text to Video

fal-ai/kling-video/lipsync/audio-to-video
Kling LipSync is an audio-to-video model that generates realistic lip movements from audio input.
Inference
Commercial use
Partner

Input

Result

Idle
This generation takes approximately 12m.

What would you like to do next?

Your request will be priced $0.014 per input video seconds, rolling up to closest 5 second increment. For example, if your video's duration is 3 seconds, it will be billed as a 5 second video

Logs