Model Gallery

See all available model APIs provided by fal.ai
Text to Video

Veo 3

Veo 3 by Google, the most advanced AI video generation model in the world. Now available at fal with sound on!

Kling 2.1 Master

Kling 2.1 Master: The premium endpoint for Kling 2.1, designed for top-tier image-to-video generation with unparalleled motion fluidity, cinematic visuals, and exceptional prompt precision.

Category

All Models

Explore all available models provided by fal.ai

background texture
fal-ai/wan-pro/image-to-video
image-to-video

Wan-2.1 Pro is a premium image-to-video model that generates high-quality 1080p videos at 30fps with up to 6 seconds duration, delivering exceptional visual quality and motion diversity from images

image to video
motion
background texture
fal-ai/wan-i2v
image-to-video

Wan-2.1 is a image-to-video model that generates high-quality videos with high visual quality and motion diversity from images

image to video
motion
background texture
fal-ai/kling-video/v1.6/pro/image-to-video
image-to-video

Generate video clips from your images using Kling 1.6 (pro)

background texture
fal-ai/flux-lora-fast-training
training

Train styles, people and other subjects at blazing speeds.

lora
personalization
background texture
fal-ai/playai/tts/dialog
text-to-audio

Generate natural-sounding multi-speaker dialogues, and audio. Perfect for expressive outputs, storytelling, games, animations, and interactive media.

audio
background texture
fal-ai/flux-pro/v1.1-ultra
text-to-image

FLUX1.1 [pro] ultra is the newest version of FLUX1.1 [pro], maintaining professional-grade image quality while delivering up to 2K resolution with improved photo realism.

high-res
realism
background texture
fal-ai/recraft/v3/text-to-image
text-to-image

Recraft V3 is a text-to-image model with the ability to generate long texts, vector art, images in brand style, and much more. As of today, it is SOTA in image generation, proven by Hugging Face's industry-leading Text-to-Image Benchmark by Artificial Analysis.

vector
typography
style
background texture
fal-ai/minimax/video-01/image-to-video
image-to-video

Generate video clips from your images using MiniMax Video model

motion
transformation
background texture
fal-ai/flux-kontext-trainer
training

LoRA trainer for FLUX.1 Kontext [dev]

new
background texture
fal-ai/minimax/hailuo-02/standard/text-to-video
text-to-video

MiniMax Hailuo-02 Text To Video API (Standard, 768p): Advanced video generation model with 768p resolution

new
background texture
fal-ai/bytedance/seedance/v1/pro/image-to-video
image-to-video

Seedance 1.0 Pro, a high quality video generation model developed by Bytedance.

new
background texture
fal-ai/kling-video/v2.1/standard/image-to-video
image-to-video

Kling 2.1 Standard is a cost-efficient endpoint for the Kling 2.1 model, delivering high-quality image-to-video generation

background texture
fal-ai/tavus/hummingbird-lipsync/v0
video-to-video

Generate lip sync using Tavus' state-of-the-art model for high-quality synchronization.

background texture
fal-ai/kling-video/v2/master/text-to-video
text-to-video

Generate video clips from your prompts using Kling 2.0 Master

background texture
fal-ai/hidream-i1-full
text-to-image

HiDream-I1 full is a new open-source image generative foundation model with 17B parameters that achieves state-of-the-art image generation quality within seconds.

background texture
fal-ai/hidream-i1-dev
text-to-image

HiDream-I1 dev is a new open-source image generative foundation model with 17B parameters that achieves state-of-the-art image generation quality within seconds.

background texture
fal-ai/hidream-i1-fast
text-to-image

HiDream-I1 fast is a new open-source image generative foundation model with 17B parameters that achieves state-of-the-art image generation quality within 16 steps.

background texture
fal-ai/flux/dev
text-to-image

FLUX.1 [dev] is a 12 billion parameter flow transformer that generates high-quality images from text. It is suitable for personal and commercial use.

background texture
fal-ai/mmaudio-v2
video-to-video

MMAudio generates synchronized audio given video and/or text inputs. It can be combined with video models to get videos with audio.

ai video
fast
background texture
fal-ai/ideogram/v2
text-to-image

Generate high-quality images, posters, and logos with Ideogram V2. Features exceptional typography handling and realistic outputs optimized for commercial and creative use.

realism
typography
background texture
fal-ai/flux-lora-portrait-trainer
training

FLUX LoRA training optimized for portrait generation, with bright highlights, excellent prompt following and highly detailed results.

lora
personalization
background texture
fal-ai/stable-diffusion-v35-large
text-to-image

Stable Diffusion 3.5 Large is a Multimodal Diffusion Transformer (MMDiT) text-to-image model that features improved performance in image quality, typography, complex prompt understanding, and resource-efficiency.

diffusion
typography
style
background texture
fal-ai/flux-lora/inpainting
text-to-image

Super fast endpoint for the FLUX.1 [dev] inpainting model with LoRA support, enabling rapid and high-quality image inpaingting using pre-trained LoRA adaptations for personalization, specific styles, brand identities, and product-specific outputs.

lora
personalization
background texture
fal-ai/flux-general
text-to-image

A versatile endpoint for the FLUX.1 [dev] model that supports multiple AI extensions including LoRA, ControlNet conditioning, and IP-Adapter integration, enabling comprehensive control over image generation through various guidance methods.

lora
controlnet
ip-adapter
background texture
fal-ai/flux-lora
text-to-image

Super fast endpoint for the FLUX.1 [dev] model with LoRA support, enabling rapid and high-quality image generation using pre-trained LoRA adaptations for personalization, specific styles, brand identities, and product-specific outputs.

lora
personalization
background texture
fal-ai/flux/dev/image-to-image
image-to-image

FLUX.1 Image-to-Image is a high-performance endpoint for the FLUX.1 [dev] model that enables rapid transformation of existing images, delivering high-quality style transfers and image modifications with the core FLUX capabilities.

style transfer
background texture
fal-ai/aura-sr
image-to-image

Upscale your images with AuraSR.

upscaling
high-res
background texture
fal-ai/clarity-upscaler
image-to-image

Clarity upscaler for upscaling images with high very fidelity.

upscaling
background texture
fal-ai/vidu/q1/reference-to-video
image-to-video

Generate video clips from your multiple image references using Vidu Q1

new
stylized
transform
background texture
fal-ai/bria/reimagine
image-to-image

Structure Reference allows generating new images while preserving the structure of an input image, guided by text prompts. Perfect for transforming sketches, illustrations, or photos into new illustrations. Trained exclusively on licensed data for safe and risk-free commercial use.

new
background texture
fal-ai/pixverse/sound-effects
video-to-video

Add immersive sound effects and background music to your videos using PixVerse sound effects generation

new
audio
utility
background texture
fal-ai/image-editing/realism
image-to-image

Add details to faces, enhance face features, remove blur.

new
stylized
transform
realism
background texture
fal-ai/thinksound/audio
video-to-video

Generate realistic audio from a video with an optional text prompt

new
audio-generation
video-to-audio
background texture
fal-ai/thinksound
video-to-video

Generate realistic audio for a video with an optional text prompt and combine

new
audio-generation
video-to-audio
background texture
fal-ai/post-processing/vignette
image-to-image

Add a darkening vignette effect around the edges of the image with adjustable strength

new
stylized
transform
background texture
fal-ai/post-processing/solarize
image-to-image

Apply solarization effect by inverting pixel values above a threshold

new
stylized
transform
background texture
fal-ai/post-processing/sharpen
image-to-image

Apply sharpening effects with three modes: basic unsharp mask, smart sharpening with edge preservation, and Contrast Adaptive Sharpening (CAS).

new
stylized
transform
background texture
fal-ai/post-processing/parabolize
image-to-image

Apply a parabolic distortion effect with configurable coefficient and vertex position.

new
stylized
transform
background texture
fal-ai/post-processing/grain
image-to-image

Apply film grain effect with different styles (modern, analog, kodak, fuji, cinematic, newspaper) and customizable intensity and scale

new
stylized
transform
background texture
fal-ai/post-processing/dodge-burn
image-to-image

Apply dodge and burn effects with multiple modes and adjustable intensity.

new
stylized
transform
background texture
fal-ai/post-processing/dissolve
image-to-image

Blend two images together using smooth linear interpolation with a configurable blend factor.

new
stylized
transform
background texture
fal-ai/post-processing/desaturate
image-to-image

Reduce color saturation using different methods (luminance Rec.709, luminance Rec.601, average, lightness) with adjustable factor.

new
stylized
transform
background texture
fal-ai/post-processing/color-tint
image-to-image

Apply various color tints (sepia, red, green, blue, cyan, magenta, yellow, purple, orange, warm, cool, lime, navy, vintage, rose, teal, maroon, peach, lavender, olive) with adjustable strength.

new
stylized
transform
background texture
fal-ai/post-processing/color-correction
image-to-image

Adjust color temperature, brightness, contrast, saturation, and gamma values for color correction.

new
stylized
transform
background texture
fal-ai/post-processing/chromatic-aberration
image-to-image

Create chromatic aberration by shifting red, green, and blue channels horizontally or vertically with customizable shift amounts.

new
stylized
transform
background texture
fal-ai/post-processing/blur
image-to-image

Apply Gaussian or Kuwahara blur effects with adjustable radius and sigma parameters

new
stylized
transform
background texture
fal-ai/pixverse/extend/fast
video-to-video

PixVerse Extend model is a video extending tool for your videos using with high-quality video extending techniques

new
utility
editing
background texture
fal-ai/pixverse/extend
video-to-video

PixVerse Extend model is a video extending tool for your videos using with high-quality video extending techniques

new
utility
editing
background texture
fal-ai/pixverse/lipsync
video-to-video

Generate realistic lipsync animations from audio using advanced algorithms for high-quality synchronization with PixVerse Lipsync model

new
animation
lip sync
background texture
fal-ai/image-editing/youtube-thumbnails
image-to-image

Generate YouTube thumbnails with custom text

new
stylized
transform
background texture
bria/video/background-removal
video-to-video

Automatically remove backgrounds from videos -perfect for creating clean, professional content without a green screen.

new
background-removal
background texture
fal-ai/luma-dream-machine/ray-2/modify
video-to-video

Ray2 Modify is a video generative model capable of restyling or retexturing the entire shot, from turning live-action into CG or stylized animation, to changing wardrobe, props, or the overall aesthetic and swap environments or time periods, giving you control over background, location, or even weather.

new
modify
restyle
background texture
fal-ai/bytedance/seededit/v3/edit-image
image-to-image

SeedEdit 3.0 is an image editing model independently developed by ByteDance. It excels in accurately following editing instructions and effectively preserving image content, especially excelling in handling real images

new
image-editing
image-to-image
background texture
fal-ai/image-editing/broccoli-haircut
image-to-image

Transform your character's hair into broccoli style while keeping the original characters likeness

new
stylized
transform
background texture
fal-ai/image-editing/wojak-style
image-to-image

Transform your photos into wojak style while keeping the original characters likeness

new
stylized
transform
background texture
fal-ai/image-editing/plushie-style
image-to-image

Transform your photos into cool plushies while keeping the original characters likeness

new
stylized
transform
background texture
fal-ai/flux-kontext/dev
image-to-image

Frontier image editing model.

new
background texture
fal-ai/flux-kontext-lora/image-to-image
image-to-image

Super fast image-to-image endpoint for the FLUX.1 Kontext [dev] model with LoRA support, enabling rapid and high-quality image generation using pre-trained LoRA adaptations for personalization, specific styles, brand identities, and product-specific outputs.

new
image-to-image
background texture
fal-ai/flux-kontext-lora/text-to-image
text-to-image

Super fast text-to-image endpoint for the FLUX.1 Kontext [dev] model with LoRA support, enabling rapid and high-quality image generation using pre-trained LoRA adaptations for personalization, specific styles, brand identities, and product-specific outputs.

new
text-to-image
background texture
fal-ai/flux-kontext-lora
image-to-image

Fast endpoint for the FLUX.1 Kontext [dev] model with LoRA support, enabling rapid and high-quality image editing using pre-trained LoRA adaptations for specific styles, brand identities, and product-specific outputs.

new
image-editing
image-to-image
background texture
fal-ai/omnigen-v2
text-to-image

OmniGen is a unified image generation model that can generate a wide range of images from multi-modal prompts. It can be used for various tasks such as Image Editing, Personalized Image Generation, Virtual Try-On, Multi Person Generation and more!

new
multimodal
editing
try-on
background texture
fal-ai/fashn/tryon/v1.6
image-to-image

FASHN v1.6 delivers precise virtual try-on capabilities, accurately rendering garment details like text and patterns at 864x1296 resolution from both on-model and flat-lay photo references.

new
try-on
fashion
clothing
background texture
fal-ai/ai-avatar/single-text
image-to-video

MultiTalk model generates a talking avatar video from an image and text. Converts text to speech automatically, then generates the avatar speaking with lip-sync.

new
stylized
transform
background texture
fal-ai/ai-avatar
image-to-video

MultiTalk model generates a talking avatar video from an image and audio file. The avatar lip-syncs to the provided audio with natural facial expressions.

new
stylized
transform
background texture
fal-ai/ai-avatar/multi-text
image-to-video

MultiTalk model generates a multi-person conversation video from an image and text inputs. Converts text to speech for each person, generating a realistic conversation scene.

new
stylized
transform
background texture
fal-ai/ai-avatar/multi
image-to-video

MultiTalk model generates a multi-person conversation video from an image and audio files. Creates a realistic scene where multiple people speak in sequence.

new
stylized
transform
background texture
fal-ai/video-understanding
vision

A video understanding model to analyze video content and answer questions about what's happening in the video based on user prompts.

new
utility
vision
background texture
fal-ai/wan-vace-14b/reframe
video-to-video

VACE is a video generation model that uses a source image, mask, and video to create prompted videos with controllable sources.

new
reframe
background texture
fal-ai/wan-vace-14b/outpainting
video-to-video

VACE is a video generation model that uses a source image, mask, and video to create prompted videos with controllable sources.

new
image-to-video
video-to-video
text-to-video
background texture
fal-ai/wan-vace-14b/inpainting
video-to-video

VACE is a video generation model that uses a source image, mask, and video to create prompted videos with controllable sources.

new
image-to-video
video-to-video
text-to-video
background texture
fal-ai/wan-vace-14b/pose
video-to-video

VACE is a video generation model that uses a source image, mask, and video to create prompted videos with controllable sources.

new
image-to-video
video-to-video
text-to-video
background texture
fal-ai/wan-vace-14b/depth
video-to-video

VACE is a video generation model that uses a source image, mask, and video to create prompted videos with controllable sources.

new
image-to-video
video-to-video
text-to-video
background texture
fal-ai/chain-of-zoom
image-to-image

Extreme Super-Resolution via Scale Autoregression and Preference Alignment

new
background texture
tripo3d/tripo/v2.5/multiview-to-3d
image-to-3d

State of the art Multiview to 3D Object generation

new
stylized
multiview
background texture
fal-ai/minimax/hailuo-02/pro/image-to-video
image-to-video

MiniMax Hailuo-02 Image To Video API (Pro, 1080p): Advanced image-to-video generation model with 1080p resolution

new
background texture
fal-ai/minimax/hailuo-02/pro/text-to-video
text-to-video

MiniMax Hailuo-02 Text To Video API (Pro, 1080p): Advanced video generation model with 1080p resolution

new
background texture
fal-ai/pasd
image-to-image

Pixel-Aware Diffusion Model for Realistic Image Super-Resolution and Personalized Stylization

new
utility
editing
background texture
bria/text-to-image/3.2
text-to-image

Bria’s Text-to-Image model, trained exclusively on licensed data for safe and risk-free commercial use. Excels in Text-Rendering and Aesthetics.

new
image generation
background texture
fal-ai/object-removal/bbox
image-to-image

Removes box-selected objects and their visual effects, seamlessly reconstructing the scene with contextually appropriate content.

new
utility
editing
background texture
fal-ai/object-removal/mask
image-to-image

Removes mask-selected objects and their visual effects, seamlessly reconstructing the scene with contextually appropriate content.

new
utility
editing
background texture
fal-ai/object-removal
image-to-image

Removes objects and their visual effects using natural language, replacing them with contextually appropriate content

new
utility
editing
background texture
fal-ai/bytedance/seedance/v1/pro/text-to-video
text-to-video

Seedance 1.0 Pro, a high quality video generation model developed by Bytedance.

new
background texture
fal-ai/hunyuan3d-v21
image-to-3d

Hunyuan3D-2.1 is a scalable 3D asset creation system that advances state-of-the-art 3D generation through Physically-Based Rendering (PBR).

new
image-to-3d
background texture
fal-ai/bytedance/seedance/v1/lite/image-to-video
image-to-video

Seedance 1.0 Lite

new
background texture
fal-ai/bytedance/seedance/v1/lite/text-to-video
text-to-video

Seedance 1.0 Lite

new
background texture
fal-ai/recraft/vectorize
image-to-image

Converts a given raster image to SVG format using Recraft model.

new
stylized
transform
background texture
fal-ai/imagen4/preview/fast
text-to-image

Imagen 4's fast and cost-effective version. Best quality per $

new
background texture
fal-ai/wan-trainer/t2v
training

Train custom LoRAs for Wan-2.1 T2V 1.3B

new
lora
training
background texture
fal-ai/wan-trainer/t2v-14b
training

Train custom LoRAs for Wan-2.1 T2V 14B

new
lora
training
background texture
fal-ai/wan-trainer/i2v-720p
training

Train custom LoRAs for Wan-2.1 I2V 720P

new
lora
training
background texture
fal-ai/wan-trainer/flf2v-720p
training

Train custom LoRAs for Wan-2.1 FLF2V 720P

new
lora
training