LTX Video 2.0 Pro Image to Video
Input
Hint: Drag and drop image files from your computer, images from web pages, paste from clipboard (Ctrl/Cmd+V), or provide a URL. Accepted file types: jpg, jpeg, png, webp, gif, avif

Customize your input with more control.
Result
What would you like to do next?
Your request will cost $0.06 per second for 1080p, $0.12 per second for 1440p or $0.24 per second for 2160p.
Logs
LTX Video 2.0 Pro | [image-to-video]
Lightricks' LTX Video 2.0 delivers high-fidelity image-to-video generation with audio synthesis at $0.06-$0.24 per second depending on resolution. Trading raw speed for production-grade quality, it targets creators who need broadcast-ready outputs with synchronized audio rather than rapid prototyping iterations. Built for professional video workflows where visual fidelity and audio integration justify the cost premium.
Use Cases: Social Media Content Creation | Marketing Video Production | Creative Storytelling
Performance
LTX Video 2.0 Pro positions at the premium end of image-to-video generation, trading cost efficiency for integrated audio and multi-resolution output flexibility.
| Metric | Result | Context |
|---|---|---|
| Resolution Options | 1080p, 1440p, 2160p | Three quality tiers with proportional pricing |
| Duration Range | 6-10 seconds | Configurable output length in 2-second increments |
| Cost per Second | $0.06 (1080p), $0.12 (1440p), $0.24 (2160p) | Resolution-based pricing: 16.67 seconds per $1 at 1080p, 8.33 seconds at 1440p, 4.17 seconds at 2160p |
| Frame Rate Options | 25 fps, 50 fps | Standard or high-frame-rate output |
| Audio Generation | Integrated | Synchronized audio synthesis included by default |
| Related Endpoints | LTX Video 2.0 Fast | Speed-optimized variant for faster iteration at half the cost ($0.03-$0.12/second) |
Production-Ready Video with Integrated Audio
LTX Video 2.0 Pro differentiates through native audio synthesis rather than treating it as a separate post-processing step. Where most image-to-video models output silent clips requiring manual audio layering, this architecture generates synchronized soundscapes during video creation, eliminating workflow friction for content creators.
What this means for you:
- Multi-resolution flexibility: Generate at 1080p for social media, 1440p for web content, or 2160p for broadcast-quality output without switching models
- Synchronized audio synthesis: Audio generation happens inline with video creation, matching visual cues and motion automatically
- Configurable duration control: Choose 6, 8, or 10-second outputs based on platform requirements and budget constraints
- Variable frame rate support: Standard 25 fps for most use cases or 50 fps for smooth motion in action sequences
Technical Specifications
| Spec | Details |
|---|---|
| Architecture | LTX Video 2.0 |
| Input Formats | PNG, JPEG, WebP, AVIF, HEIF images via URL or base64 data URI |
| Output Formats | MP4 video with integrated audio |
| Resolution Options | 1080p (1920×1080), 1440p (2560×1440), 2160p (3840×2160) |
| Aspect Ratio | 16:9 |
| License | Commercial use permitted under partnership terms |
API Documentation | Quickstart Guide | Enterprise Pricing
How It Stacks Up
LTX Video 2.0 Fast ($0.03-$0.12/second) – LTX Video 2.0 Pro trades generation speed for higher fidelity output and integrated audio synthesis at 2x the cost. The Fast variant prioritizes rapid iteration for prototyping workflows where audio isn't critical.
Kling Video v2.6 Pro Image to Video – LTX Video 2.0 Pro emphasizes audio-visual integration through native sound synthesis, while Kling v2.6 Pro focuses on extended duration capabilities and cinematic motion quality for visual-first productions.
Pixverse Image to Video – LTX Video 2.0 Pro differentiates through multi-resolution output tiers and synchronized audio generation. Pixverse prioritizes consistent style preservation across frames for character-driven content where visual coherence matters more than audio integration.