Kling Video V3 Standard API

Image To Video
Text To Video
Motion Control

Endpoint: POST https://fal.run/fal-ai/kling-video/v3/standard/image-to-video Endpoint ID: fal-ai/kling-video/v3/standard/image-to-video

Try it in the Playground

Run this model interactively with your own prompts.

Quick Start

import fal_client

def on_queue_update(update):
    if isinstance(update, fal_client.InProgress):
        for log in update.logs:
           print(log["message"])

result = fal_client.subscribe(
    "fal-ai/kling-video/v3/standard/image-to-video",
    arguments={
        "start_image_url": "https://storage.googleapis.com/falserverless/example_inputs/kling-v3/standard-i2v/start_image.png"
    },
    with_logs=True,
    on_queue_update=on_queue_update,
)
print(result)

import { fal } from "@fal-ai/client";

const result = await fal.subscribe("fal-ai/kling-video/v3/standard/image-to-video", {
  input: {
      start_image_url: "https://storage.googleapis.com/falserverless/example_inputs/kling-v3/standard-i2v/start_image.png"
    },
  logs: true,
  onQueueUpdate: (update) => {
    if (update.status === "IN_PROGRESS") {
      update.logs.map((log) => log.message).forEach(console.log);
    }
  },
});
console.log(result.data);
console.log(result.requestId);

curl --request POST \
  --url https://fal.run/fal-ai/kling-video/v3/standard/image-to-video \
  --header "Authorization: Key $FAL_KEY" \
  --header "Content-Type: application/json" \
  --data '{
  "start_image_url": "https://storage.googleapis.com/falserverless/example_inputs/kling-v3/standard-i2v/start_image.png"
}'

Input Schema

prompt

string

Text prompt for video generation. Either prompt or multi_prompt must be provided, but not both.

multi_prompt

list<KlingV3MultiPromptElement>

List of prompts for multi-shot video generation. If provided, divides the video into multiple shots.

start_image_url

string

required

URL of the image to be used for the video

duration

DurationEnum

default:"5"

The duration of the generated video in seconds Default value: "5"Possible values: 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15

generate_audio

boolean

default:"true"

Whether to generate native audio for the video. Supports Chinese and English voice output. Other languages are automatically translated to English. For English speech, use lowercase letters; for acronyms or proper nouns, use uppercase. Default value: true

end_image_url

string

URL of the image to be used for the end of the video

elements

list<KlingV3ComboElementInput>

Elements (characters/objects) to include in the video. Each example can either be an image set (frontal + reference images) or a video. Reference in prompt as @Element1, @Element2, etc.

shot_type

string

default:"customize"

The type of multi-shot video generation. Required when multi_prompt is provided. Default value: "customize"

negative_prompt

string

default:"blur, distort, and low quality"

Default value: "blur, distort, and low quality"

cfg_scale

float

default:"0.5"

The CFG (Classifier Free Guidance) scale is a measure of how close you want the model to stick to your prompt. Default value: 0.5Range: 0 to 1

Output Schema

video

File

required

The generated video

Input Example

{
  "prompt": "Camera slowly orbits around the vase. Soft light shifts across the ceramic surface. The pampas grass sways gently. Shadows move elegantly. Smooth continuous motion, premium feel.",
  "multi_prompt": null,
  "start_image_url": "https://storage.googleapis.com/falserverless/example_inputs/kling-v3/standard-i2v/start_image.png",
  "duration": "12",
  "generate_audio": true,
  "elements": [
    {
      "frontal_image_url": "https://v3b.fal.media/files/b/0a8cfd5f/-kZL-ha3Iuelku5IHXC-A_glasses.png",
      "reference_image_urls": [
        "https://v3b.fal.media/files/b/0a8cfd62/psPCmzrD1y9vDgdyNfKAL_glasses_back.png"
      ]
    },
    {
      "video_url": "https://v3b.fal.media/files/b/0a8cfd66/b03SOiQvKLlFx_jqdNZ9z_child_video.mp4"
    }
  ],
  "shot_type": "customize",
  "negative_prompt": "blur, distort, and low quality",
  "cfg_scale": 0.5
}

Output Example

{
  "video": {
    "content_type": "video/mp4",
    "file_name": "out.mp4",
    "file_size": 3149129,
    "url": "https://storage.googleapis.com/falserverless/example_outputs/kling-v3/standard-i2v/out.mp4"
  }
}

Endpoint: POST https://fal.run/fal-ai/kling-video/v3/standard/text-to-video Endpoint ID: fal-ai/kling-video/v3/standard/text-to-video

Try it in the Playground

Run this model interactively with your own prompts.

Quick Start

import fal_client

def on_queue_update(update):
    if isinstance(update, fal_client.InProgress):
        for log in update.logs:
           print(log["message"])

result = fal_client.subscribe(
    "fal-ai/kling-video/v3/standard/text-to-video",
    arguments={},
    with_logs=True,
    on_queue_update=on_queue_update,
)
print(result)

import { fal } from "@fal-ai/client";

const result = await fal.subscribe("fal-ai/kling-video/v3/standard/text-to-video", {
  input: {},
  logs: true,
  onQueueUpdate: (update) => {
    if (update.status === "IN_PROGRESS") {
      update.logs.map((log) => log.message).forEach(console.log);
    }
  },
});
console.log(result.data);
console.log(result.requestId);

curl --request POST \
  --url https://fal.run/fal-ai/kling-video/v3/standard/text-to-video \
  --header "Authorization: Key $FAL_KEY" \
  --header "Content-Type: application/json" \
  --data '{}'

Input Schema

prompt

string

Text prompt for video generation. Either prompt or multi_prompt must be provided, but not both.

duration

DurationEnum

default:"5"

The duration of the generated video in seconds Default value: "5"Possible values: 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15

multi_prompt

list<KlingV3MultiPromptElement>

List of prompts for multi-shot video generation. If provided, overrides the single prompt and divides the video into multiple shots with specified prompts and durations.

generate_audio

boolean

default:"true"

shot_type

ShotTypeEnum

default:"customize"

The type of multi-shot video generation Default value: "customize"Possible values: customize, intelligent

aspect_ratio

AspectRatioEnum

default:"16:9"

The aspect ratio of the generated video frame Default value: "16:9"Possible values: 16:9, 9:16, 1:1

negative_prompt

string

default:"blur, distort, and low quality"

Default value: "blur, distort, and low quality"

cfg_scale

float

default:"0.5"

The CFG (Classifier Free Guidance) scale is a measure of how close you want the model to stick to your prompt. Default value: 0.5Range: 0 to 1

Output Schema

video

File

required

The generated video

Input Example

{
  "prompt": "Cinematic drone shot flying through ancient stone ruins covered in moss and vines at golden hour. Camera starts low, rises through crumbling archways, revealing a vast misty valley beyond. Volumetric light rays pierce through gaps in the stone. Epic scale, photorealistic, 8K quality.",
  "duration": "5",
  "multi_prompt": null,
  "generate_audio": true,
  "shot_type": "customize",
  "aspect_ratio": "16:9",
  "negative_prompt": "blur, distort, and low quality",
  "cfg_scale": 0.5
}

Output Example

{
  "video": {
    "content_type": "video/mp4",
    "file_name": "output.mp4",
    "file_size": 6797486,
    "url": "https://storage.googleapis.com/falserverless/example_outputs/kling-v3/standard-t2v/out.mp4"
  }
}

Endpoint: POST https://fal.run/fal-ai/kling-video/v3/standard/motion-control Endpoint ID: fal-ai/kling-video/v3/standard/motion-control

Try it in the Playground

Run this model interactively with your own prompts.

Quick Start

import fal_client

def on_queue_update(update):
    if isinstance(update, fal_client.InProgress):
        for log in update.logs:
           print(log["message"])

result = fal_client.subscribe(
    "fal-ai/kling-video/v3/standard/motion-control",
    arguments={
        "image_url": "https://v3b.fal.media/files/b/0a90ee31/VHtWK5BZMa-XoT6gahJKS_077.png",
        "video_url": "https://v3b.fal.media/files/b/0a90edae/3nvl30ic9g2otKRcOV5nO_output.mp4",
        "character_orientation": "video"
    },
    with_logs=True,
    on_queue_update=on_queue_update,
)
print(result)

import { fal } from "@fal-ai/client";

const result = await fal.subscribe("fal-ai/kling-video/v3/standard/motion-control", {
  input: {
      image_url: "https://v3b.fal.media/files/b/0a90ee31/VHtWK5BZMa-XoT6gahJKS_077.png",
      video_url: "https://v3b.fal.media/files/b/0a90edae/3nvl30ic9g2otKRcOV5nO_output.mp4",
      character_orientation: "video"
    },
  logs: true,
  onQueueUpdate: (update) => {
    if (update.status === "IN_PROGRESS") {
      update.logs.map((log) => log.message).forEach(console.log);
    }
  },
});
console.log(result.data);
console.log(result.requestId);

curl --request POST \
  --url https://fal.run/fal-ai/kling-video/v3/standard/motion-control \
  --header "Authorization: Key $FAL_KEY" \
  --header "Content-Type: application/json" \
  --data '{
  "image_url": "https://v3b.fal.media/files/b/0a90ee31/VHtWK5BZMa-XoT6gahJKS_077.png",
  "video_url": "https://v3b.fal.media/files/b/0a90edae/3nvl30ic9g2otKRcOV5nO_output.mp4",
  "character_orientation": "video"
}'

Input Schema

prompt

string

image_url

string

required

Reference image URL. The characters, backgrounds, and other elements in the generated video are based on this reference image. Characters should have clear body proportions, avoid occlusion, and occupy more than 5% of the image area.

video_url

string

required

Reference video URL. The character actions in the generated video will be consistent with this reference video. Should contain a realistic style character with entire body or upper body visible, including head, without obstruction. Duration limit depends on character_orientation: 10s max for ‘image’, 30s max for ‘video’.

keep_original_sound

boolean

default:"true"

Whether to keep the original sound from the reference video. Default value: true

character_orientation

CharacterOrientationEnum

required

Controls whether the output character’s orientation matches the reference image or video. ‘video’: orientation matches reference video - better for complex motions (max 30s). ‘image’: orientation matches reference image - better for following camera movements (max 10s).Possible values: image, video

elements

list<KlingV3ImageElementInput>

Optional element for facial consistency binding. Upload a facial element to enhance identity preservation in the generated video. Only 1 element is supported. Reference in prompt as @Element1. Element binding is only supported when character_orientation is ‘video’.

Output Schema

video

File

required

The generated video

Input Example

{
  "prompt": "An man dancing",
  "image_url": "https://v3b.fal.media/files/b/0a90ee31/VHtWK5BZMa-XoT6gahJKS_077.png",
  "video_url": "https://v3b.fal.media/files/b/0a90edae/3nvl30ic9g2otKRcOV5nO_output.mp4",
  "keep_original_sound": true,
  "character_orientation": "video",
  "elements": null
}

Output Example

{
  "video": {
    "url": "https://v3b.fal.media/files/b/0a90ee68/7-xpX-LTxX_nRHL776FrZ_output.mp4"
  }
}

Kling Video v3 Text to Video [Pro] — Video Generation
Kling Video v3 Image to Video [Pro] — Video Generation
Kling Video — Video Generation

Limitations

cfg_scale range: 0 to 1
shot_type restricted to: customize, intelligent
aspect_ratio restricted to: 16:9, 9:16, 1:1
character_orientation restricted to: image, video

Try it in the Playground

​Quick Start

​Input Schema

​Output Schema

​Input Example

​Output Example

Try it in the Playground

​Quick Start

​Input Schema

​Output Schema

​Input Example

​Output Example

Try it in the Playground

​Quick Start

​Input Schema

​Output Schema

​Input Example

​Output Example

​Related

​Limitations

Quick Start

Input Schema

Output Schema

Input Example

Output Example

Quick Start

Input Schema

Output Schema

Input Example

Output Example

Quick Start

Input Schema

Output Schema

Input Example

Output Example

Related

Limitations