Kling LipSync Audio-to-Video Text to Video
fal-ai/kling-video/lipsync/audio-to-video
Kling LipSync is an audio-to-video model that generates realistic lip movements from audio input.
Inference
Commercial use
Partner
Input
Hint: Drag and drop video files from your computer, video from web pages, paste from clipboard (Ctrl/Cmd+V), or provide a URL. Accepted file types: mp4, mov, webm, m4v, gif
Hint: Drag and drop audio files from your computer, audio from web pages, paste from clipboard (Ctrl/Cmd+V), or provide a URL. Accepted file types: mp3, ogg, wav, m4a, aac
Result
Idle
This generation takes approximately 12m.
What would you like to do next?
Your request will be priced $0.014 per input video seconds, rolling up to closest 5 second increment. For example, if your video's duration is 3 seconds, it will be billed as a 5 second video