LTX Video 2.0 Pro Image to Video

fal-ai/ltx-2/image-to-video
Create high-fidelity video with audio from images with LTX-2 Pro
Inference
Commercial use
Partner

Input

Additional Settings

Customize your input with more control.

Result

Idle

What would you like to do next?

Your request will cost $0.06 per second for 1080p, $0.12 per second for 1440p or $0.24 per second for 2160p.

Logs

LTX Video 2.0 Pro | [image-to-video]

Lightricks' LTX Video 2.0 delivers high-fidelity image-to-video generation with audio synthesis at $0.06-$0.24 per second depending on resolution. Trading raw speed for production-grade quality, it targets creators who need broadcast-ready outputs with synchronized audio rather than rapid prototyping iterations. Built for professional video workflows where visual fidelity and audio integration justify the cost premium.

Use Cases: Social Media Content Creation | Marketing Video Production | Creative Storytelling


Performance

LTX Video 2.0 Pro positions at the premium end of image-to-video generation, trading cost efficiency for integrated audio and multi-resolution output flexibility.

MetricResultContext
Resolution Options1080p, 1440p, 2160pThree quality tiers with proportional pricing
Duration Range6-10 secondsConfigurable output length in 2-second increments
Cost per Second$0.06 (1080p), $0.12 (1440p), $0.24 (2160p)Resolution-based pricing: 16.67 seconds per $1 at 1080p, 8.33 seconds at 1440p, 4.17 seconds at 2160p
Frame Rate Options25 fps, 50 fpsStandard or high-frame-rate output
Audio GenerationIntegratedSynchronized audio synthesis included by default
Related EndpointsLTX Video 2.0 FastSpeed-optimized variant for faster iteration at half the cost ($0.03-$0.12/second)

Production-Ready Video with Integrated Audio

LTX Video 2.0 Pro differentiates through native audio synthesis rather than treating it as a separate post-processing step. Where most image-to-video models output silent clips requiring manual audio layering, this architecture generates synchronized soundscapes during video creation, eliminating workflow friction for content creators.

What this means for you:

  • Multi-resolution flexibility: Generate at 1080p for social media, 1440p for web content, or 2160p for broadcast-quality output without switching models
  • Synchronized audio synthesis: Audio generation happens inline with video creation, matching visual cues and motion automatically
  • Configurable duration control: Choose 6, 8, or 10-second outputs based on platform requirements and budget constraints
  • Variable frame rate support: Standard 25 fps for most use cases or 50 fps for smooth motion in action sequences

Technical Specifications

SpecDetails
ArchitectureLTX Video 2.0
Input FormatsPNG, JPEG, WebP, AVIF, HEIF images via URL or base64 data URI
Output FormatsMP4 video with integrated audio
Resolution Options1080p (1920×1080), 1440p (2560×1440), 2160p (3840×2160)
Aspect Ratio16:9
LicenseCommercial use permitted under partnership terms

API Documentation | Quickstart Guide | Enterprise Pricing


How It Stacks Up

LTX Video 2.0 Fast ($0.03-$0.12/second) – LTX Video 2.0 Pro trades generation speed for higher fidelity output and integrated audio synthesis at 2x the cost. The Fast variant prioritizes rapid iteration for prototyping workflows where audio isn't critical.

Kling Video v2.6 Pro Image to Video – LTX Video 2.0 Pro emphasizes audio-visual integration through native sound synthesis, while Kling v2.6 Pro focuses on extended duration capabilities and cinematic motion quality for visual-first productions.

Pixverse Image to Video – LTX Video 2.0 Pro differentiates through multi-resolution output tiers and synchronized audio generation. Pixverse prioritizes consistent style preservation across frames for character-driven content where visual coherence matters more than audio integration.