Model Gallery
Veo 3
Veo 3 by Google, the most advanced AI video generation model in the world. Now available at fal with sound on!
Kling 2.1 Master
Kling 2.1 Master: The premium endpoint for Kling 2.1, designed for top-tier image-to-video generation with unparalleled motion fluidity, cinematic visuals, and exceptional prompt precision.
Search trends
Featured Models
Check out some of our most popular models
MiniMax Hailuo-02 Image To Video API (Standard, 768p): Advanced image-to-video generation model with 768p resolution
Kling 2.1 Master: The premium endpoint for Kling 2.1, designed for top-tier image-to-video generation with unparalleled motion fluidity, cinematic visuals, and exceptional prompt precision.
Generate video clips from your images using Kling 2.0 Master
Search Results
79 models found
Wan Effects generates high-quality videos with popular effects from images
Wan-2.1 Pro is a premium image-to-video model that generates high-quality 1080p videos at 30fps with up to 6 seconds duration, delivering exceptional visual quality and motion diversity from images
Veo 2 creates videos from images with realistic motion and very high quality output.
Wan-2.1 is a image-to-video model that generates high-quality videos with high visual quality and motion diversity from images
Generate video clips from your images using Kling 1.6 (pro)
Generate video clips from your images using MiniMax Video model
Seedance 1.0 Pro, a high quality video generation model developed by Bytedance.
Kling 2.1 Standard is a cost-efficient endpoint for the Kling 2.1 model, delivering high-quality image-to-video generation
Generate high quality video clips from text and image prompts using PixVerse v4.5
Generate video clips from your multiple image references using Vidu Q1
MultiTalk model generates a talking avatar video from an image and text. Converts text to speech automatically, then generates the avatar speaking with lip-sync.
MultiTalk model generates a talking avatar video from an image and audio file. The avatar lip-syncs to the provided audio with natural facial expressions.
MultiTalk model generates a multi-person conversation video from an image and text inputs. Converts text to speech for each person, generating a realistic conversation scene.
MultiTalk model generates a multi-person conversation video from an image and audio files. Creates a realistic scene where multiple people speak in sequence.
MiniMax Hailuo-02 Image To Video API (Pro, 1080p): Advanced image-to-video generation model with 1080p resolution
Seedance 1.0 Lite
Phantom is a unified video generation framework for single and multi-subject references, built on existing text-to-video and image-to-video architectures.
HunyuanAvatar is a High-Fidelity Audio-Driven Human Animation model for Multiple Characters .
Kling 2.1 Pro is an advanced endpoint for the Kling 2.1 model, offering professional-grade videos with enhanced visual fidelity, precise camera movements, and dynamic motion control, perfect for cinematic storytelling.
HunyuanPortrait is a diffusion-based framework for generating lifelike, temporally consistent portrait animations.
Generate video clips from your multiple image references using Kling 1.6 (standard)
Generate video clips from your multiple image references using Kling 1.6 (pro)
Generate videos from prompts and images using LTX Video-0.9.7 13B Distilled and custom LoRA
Generate videos from prompts and images using LTX Video-0.9.7 13B and custom LoRA
Generate videos from prompts and images using LTX Video-0.9.7 and custom LoRA
Create seamless transition between images using PixVerse v4.5
Generate fast high quality video clips from text and image prompts using PixVerse v4.5
Generate high quality video clips with different effects using PixVerse v4.5
HunyuanCustom revolutionizes video generation with unmatched identity consistency across multiple input types. Its innovative fusion modules and alignment networks outperform competitors, maintaining subject integrity while responding flexibly to text, image, audio, and video conditions.
Framepack is an efficient Image-to-video model that autoregressively generates videos.
Vidu Q1 Start-End to Video generates smooth transition 1080p videos between specified start and end images.
Vidu Q1 Image to Video generates high-quality 1080p videos with exceptional visual quality and motion diversity from a single image
MAGI-1 generates videos from images with exceptional understanding of physical interactions and prompting
Generate high quality video clips with different effects using PixVerse v4
MAGI-1 distilled generates videos faster from images with exceptional understanding of physical interactions and prompting
Framepack is an efficient Image-to-video model that autoregressively generates videos.
Wan-2.1 flf2v generates dynamic videos by intelligently bridging a given first frame to a desired end frame through smooth, coherent motion sequences.
Framepack is an efficient Image-to-video model that autoregressively generates videos.
Generate fast high quality video clips from text and image prompts using PixVerse v4
Generate high quality video clips from text and image prompts using PixVerse v4
Generate high quality video clips with different effects using PixVerse v3.5
Create seamless transition between images using PixVerse v3.5
Ray2 Flash is a fast video generative model capable of creating realistic visuals with natural, coherent motion.
Pika v2 Turbo creates videos from images with high quality output.
Pika v2.2 creates videos from images with high quality output.
Pika Scenes v2.2 creates videos from a images with high quality output.
Pika v2.1 creates videos from images with high quality output.
Pika Effects are AI-powered video effects designed to modify objects, characters, and environments in a fun, engaging, and visually compelling manner.
Vidu Image to Video generates high-quality videos with exceptional visual quality and motion diversity from a single image
Vidu Start-End to Video generates smooth transition videos between specified start and end images.
Vidu Reference to Video creates videos by using a reference images and combining them with a prompt.
Vidu Template to Video lets you create different effects by applying motion templates to your images.
Add custom LoRAs to Wan-2.1 is a image-to-video model that generates high-quality videos with high visual quality and motion diversity from images
Image to Video for the high-quality Hunyuan Video I2V model.
Generate video clips more accurately with respect to initial image, natural language descriptions, and using camera movement instructions for shot control.
SkyReels V1 is the first and most advanced open-source human-centric video foundation model. By fine-tuning HunyuanVideo on O(10M) high-quality film and television clips
Ray2 is a large-scale video generative model capable of creating realistic visuals with natural, coherent motion.
Image to Video for the Hunyuan Video model using a custom trained LoRA.
Generate high quality video clips from text and image prompts quickly using PixVerse v3.5 Fast
Generate high quality video clips from text and image prompts using PixVerse v3.5
Generate video clips maintaining consistent, realistic facial features and identity across dynamic video content
Generate video clips from your images using Kling 1.6 (std)
Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Generate video clips from your images using MiniMax Video model
Generate videos from images using LTX Video
Generate videos from images and prompts using CogVideoX-5B
Generate video clips from your images using Kling 1.0 (pro)
Generate video clips from your images using Kling 1.5 (pro)
Generate video clips from your images using Kling 1.0
Generate short video clips from your images using SVD v1.1
Interpolate between image frames
Transfer expression from a video to a portrait.
Generate video clips from your images using Luma Dream Machine v1.5
MuseTalk is a real-time high quality audio-driven lip-syncing model. Use MuseTalk to animate a face with your own audio.
Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Generate short video clips from your images using SVD v1.1 at Lightning Speed