Skip to main content
Endpoint: POST https://fal.run/fal-ai/kling-video/v3/standard/image-to-video Endpoint ID: fal-ai/kling-video/v3/standard/image-to-video

Try it in the Playground

Run this model interactively with your own prompts.

Quick Start

import fal_client

def on_queue_update(update):
    if isinstance(update, fal_client.InProgress):
        for log in update.logs:
           print(log["message"])

result = fal_client.subscribe(
    "fal-ai/kling-video/v3/standard/image-to-video",
    arguments={
        "start_image_url": "https://storage.googleapis.com/falserverless/example_inputs/kling-v3/standard-i2v/start_image.png"
    },
    with_logs=True,
    on_queue_update=on_queue_update,
)
print(result)

Input Schema

prompt
string
Text prompt for video generation. Either prompt or multi_prompt must be provided, but not both.
multi_prompt
list<KlingV3MultiPromptElement>
List of prompts for multi-shot video generation. If provided, divides the video into multiple shots.
start_image_url
string
required
URL of the image to be used for the video
duration
DurationEnum
default:"5"
The duration of the generated video in seconds Default value: "5"Possible values: 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15
generate_audio
boolean
default:"true"
Whether to generate native audio for the video. Supports Chinese and English voice output. Other languages are automatically translated to English. For English speech, use lowercase letters; for acronyms or proper nouns, use uppercase. Default value: true
end_image_url
string
URL of the image to be used for the end of the video
elements
list<KlingV3ComboElementInput>
Elements (characters/objects) to include in the video. Each example can either be an image set (frontal + reference images) or a video. Reference in prompt as @Element1, @Element2, etc.
shot_type
string
default:"customize"
The type of multi-shot video generation. Required when multi_prompt is provided. Default value: "customize"
negative_prompt
string
default:"blur, distort, and low quality"
Default value: "blur, distort, and low quality"
cfg_scale
float
default:"0.5"
The CFG (Classifier Free Guidance) scale is a measure of how close you want the model to stick to your prompt. Default value: 0.5Range: 0 to 1

Output Schema

video
File
required
The generated video

Input Example

{
  "prompt": "Camera slowly orbits around the vase. Soft light shifts across the ceramic surface. The pampas grass sways gently. Shadows move elegantly. Smooth continuous motion, premium feel.",
  "multi_prompt": null,
  "start_image_url": "https://storage.googleapis.com/falserverless/example_inputs/kling-v3/standard-i2v/start_image.png",
  "duration": "12",
  "generate_audio": true,
  "elements": [
    {
      "frontal_image_url": "https://v3b.fal.media/files/b/0a8cfd5f/-kZL-ha3Iuelku5IHXC-A_glasses.png",
      "reference_image_urls": [
        "https://v3b.fal.media/files/b/0a8cfd62/psPCmzrD1y9vDgdyNfKAL_glasses_back.png"
      ]
    },
    {
      "video_url": "https://v3b.fal.media/files/b/0a8cfd66/b03SOiQvKLlFx_jqdNZ9z_child_video.mp4"
    }
  ],
  "shot_type": "customize",
  "negative_prompt": "blur, distort, and low quality",
  "cfg_scale": 0.5
}

Output Example

{
  "video": {
    "content_type": "video/mp4",
    "file_name": "out.mp4",
    "file_size": 3149129,
    "url": "https://storage.googleapis.com/falserverless/example_outputs/kling-v3/standard-i2v/out.mp4"
  }
}

Limitations

  • cfg_scale range: 0 to 1
  • shot_type restricted to: customize, intelligent
  • aspect_ratio restricted to: 16:9, 9:16, 1:1
  • character_orientation restricted to: image, video