fal-ai/kling-video/lipsync/text-to-video

Kling LipSync is a text-to-video model that generates realistic lip movements from text input.

Inference

Commercial use

Partner

Schema

LLMs

Playground API

Input

Video Url*

Hint: Drag and drop video files from your computer, video from web pages, paste from clipboard (Ctrl/Cmd+V), or provide a URL. Accepted file types: mp4, mov, webm, m4v, gif

Text*

Voice Id*

Additional Settings

Customize your input with more control.

Result

Idle

This generation takes approximately 12m.

What would you like to do next?

Your request will be priced $0.014 per input video seconds, rolling up to closest 5 second increment. For example, if your video's duration is 3 seconds, it will be billed as a 5 second video

fal-ai/kling-video/lipsync/text-to-video

Input

Result

What would you like to do next?

Logs