LTX Video (preview) Image to Video

fal-ai/ltx-video/image-to-video
Generate videos from images using LTX Video
Inference
Research only

Input

Additional Settings

Customize your input with more control.

Result

Idle

Waiting for your input...

What would you like to do next?

Your request will cost $0.02 per video.

Logs

LTX Video | [image-to-video]

Lightricks' LTX Video transforms static images into 5-second video clips at $0.02 per generation. Trading extended duration for rapid inference and cost efficiency, this model processes image-to-video requests with 30 inference steps and configurable guidance scaling. Built for creators needing quick video extensions from existing imagery without frame-by-frame animation workflows.

Use Cases: Social Media Content Creation | Product Demo Animation | Storyboard Prototyping


Performance

At $0.02 per video generation, LTX Video delivers 50 generations per dollar, positioning it as a cost-efficient entry point for image-to-video workflows compared to premium alternatives ranging from $0.05-$0.15+ per inference.

MetricResultContext
Output Duration5 secondsStandard for rapid social content iteration
Inference Steps30 (configurable 1-50)Balances quality vs speed; adjustable via API
Cost per Video$0.0250 generations per $1.00 on fal
Guidance Scale2-10 (default 3)Controls prompt adherence strength
Related EndpointsText-to-VideoText-based generation without source image

Image-Driven Motion Synthesis at API Scale

LTX Video applies latent diffusion to the temporal dimension, extending single frames into short-form video through learned motion patterns rather than explicit keyframe animation. Unlike text-only video models that generate from scratch, this architecture preserves source image composition while synthesizing camera movement, object motion, and environmental dynamics.

What this means for you:

  • Source Image Fidelity: Maintains visual consistency with uploaded imagery, character appearance, lighting conditions, and scene composition carry through to generated motion

  • Prompt-Guided Motion Control: Natural language descriptions direct movement patterns (e.g., "drifts weightlessly," "slow and graceful") without technical animation parameters

  • Negative Prompt Refinement: Built-in quality filters suppress common artifacts (motion smear, anatomy distortions, fused fingers) through inverse guidance

  • Deterministic Generation: Seed parameter enables reproducible outputs for iterative refinement workflows


Technical Specifications

SpecDetails
ArchitectureLTX Video
Input FormatsImage URL (JPEG, PNG, WebP, GIF, AVIF) + text prompt
Output FormatsVideo file (5-second duration)
Inference Control1-50 steps, guidance scale 2-10
LicenseResearch only

API Documentation | Quickstart Guide | Enterprise Pricing


How It Stacks Up

Kling Video v2.6 Image to Video ($0.05) – LTX Video trades extended duration and resolution options for 2.5x cost efficiency at $0.02 per generation. Kling v2.6 prioritizes longer-form content (up to 10 seconds) and higher fidelity for polished commercial work where production value justifies premium pricing.

Pixverse Image to Video ($0.05) – LTX Video offers faster iteration cycles through lower per-generation cost, ideal for rapid prototyping and social content workflows. Pixverse emphasizes stylistic control and visual effects integration for creators requiring advanced motion customization at $0.05 per inference.

LongCat Video Image to Video ($0.03) – LTX Video delivers comparable pricing with streamlined parameter configuration (fewer adjustment variables). LongCat provides 720p output specification and extended control surfaces for users needing explicit resolution targeting at slightly higher cost.

MiniMax Hailuo 2.3 Pro ($0.15) – LTX Video prioritizes cost efficiency for volume workflows at 7.5x lower pricing. Hailuo 2.3 Pro targets premium production requirements with advanced temporal coherence and extended duration capabilities where per-generation cost is secondary to output quality.