Kling LipSync Text-to-Video Text to Video
fal-ai/kling-video/lipsync/text-to-video
Kling LipSync is a text-to-video model that generates realistic lip movements from text input.
Inference
Commercial use
Partner
Input
Hint: Drag and drop video files from your computer, video from web pages, paste from clipboard (Ctrl/Cmd+V), or provide a URL. Accepted file types: mp4, mov, webm, m4v, gif
Additional Settings
Customize your input with more control.
Result
Idle
This generation takes approximately 12m.
Your request will be priced $0.014 per input video seconds, rolling up to closest 5 second increment. For example, if your video's duration is 3 seconds, it will be billed as a 5 second video