Model Gallery
Veo 3
Veo 3 by Google, the most advanced AI video generation model in the world. Now available at fal with sound on!
Kling 2.1 Master
Kling 2.1 Master: The premium endpoint for Kling 2.1, designed for top-tier image-to-video generation with unparalleled motion fluidity, cinematic visuals, and exceptional prompt precision.
Search trends
Featured Models
Check out some of our most popular models
MiniMax Hailuo-02 Image To Video API (Standard, 768p): Advanced image-to-video generation model with 768p resolution
Seedance 1.0 Pro, a high quality video generation model developed by Bytedance.
Kling 2.1 Master: The premium endpoint for Kling 2.1, designed for top-tier image-to-video generation with unparalleled motion fluidity, cinematic visuals, and exceptional prompt precision.
Search Results
40 models found
Generate high quality video clips from text and image prompts using PixVerse v4.5
Generate video clips from your images using Kling 2.0 Master
Wan Effects generates high-quality videos with popular effects from images
Wan-2.1 Pro is a premium image-to-video model that generates high-quality 1080p videos at 30fps with up to 6 seconds duration, delivering exceptional visual quality and motion diversity from images
Veo 2 creates videos from images with realistic motion and very high quality output.
Wan-2.1 is a image-to-video model that generates high-quality videos with high visual quality and motion diversity from images
Generate video clips from your images using Kling 1.6 (pro)
Generate video clips from your images using MiniMax Video model
Kling 2.1 Standard is a cost-efficient endpoint for the Kling 2.1 model, delivering high-quality image-to-video generation
MultiTalk model generates a talking avatar video from an image and text. Converts text to speech automatically, then generates the avatar speaking with lip-sync.
MultiTalk model generates a talking avatar video from an image and audio file. The avatar lip-syncs to the provided audio with natural facial expressions.
MultiTalk model generates a multi-person conversation video from an image and text inputs. Converts text to speech for each person, generating a realistic conversation scene.
MultiTalk model generates a multi-person conversation video from an image and audio files. Creates a realistic scene where multiple people speak in sequence.
MiniMax Hailuo-02 Image To Video API (Pro, 1080p): Advanced image-to-video generation model with 1080p resolution
Seedance 1.0 Lite
Phantom is a unified video generation framework for single and multi-subject references, built on existing text-to-video and image-to-video architectures.
HunyuanAvatar is a High-Fidelity Audio-Driven Human Animation model for Multiple Characters .
Kling 2.1 Pro is an advanced endpoint for the Kling 2.1 model, offering professional-grade videos with enhanced visual fidelity, precise camera movements, and dynamic motion control, perfect for cinematic storytelling.
HunyuanPortrait is a diffusion-based framework for generating lifelike, temporally consistent portrait animations.
Generate video clips from your multiple image references using Kling 1.6 (standard)
Generate video clips from your multiple image references using Kling 1.6 (pro)
Generate videos from prompts and images using LTX Video-0.9.7 13B Distilled and custom LoRA
Generate videos from prompts and images using LTX Video-0.9.7 13B and custom LoRA
Generate videos from prompts and images using LTX Video-0.9.7 and custom LoRA
Create seamless transition between images using PixVerse v4.5
Generate fast high quality video clips from text and image prompts using PixVerse v4.5
Generate high quality video clips with different effects using PixVerse v4.5
HunyuanCustom revolutionizes video generation with unmatched identity consistency across multiple input types. Its innovative fusion modules and alignment networks outperform competitors, maintaining subject integrity while responding flexibly to text, image, audio, and video conditions.
Framepack is an efficient Image-to-video model that autoregressively generates videos.
Vidu Q1 Start-End to Video generates smooth transition 1080p videos between specified start and end images.
Vidu Q1 Image to Video generates high-quality 1080p videos with exceptional visual quality and motion diversity from a single image
MAGI-1 generates videos from images with exceptional understanding of physical interactions and prompting
Generate high quality video clips with different effects using PixVerse v4
MAGI-1 distilled generates videos faster from images with exceptional understanding of physical interactions and prompting
Framepack is an efficient Image-to-video model that autoregressively generates videos.
Wan-2.1 flf2v generates dynamic videos by intelligently bridging a given first frame to a desired end frame through smooth, coherent motion sequences.
Framepack is an efficient Image-to-video model that autoregressively generates videos.