Model Gallery

See all available model APIs provided by fal.ai
Can't find a model?Suggest a model

Kling V1.5

New Kling models are here! You can generate video clips from your prompts or images using Kling 1.5 (pro)

Featured Models

Check out some of our most popular models

fal-ai/flux-pro/v1.1-ultra
text-to-image

FLUX1.1 [pro] ultra is the newest version of FLUX1.1 [pro], maintaining professional-grade image quality while delivering up to 2K resolution with improved photo realism.

flux
high resolution
realism
fal-ai/flux-lora-fast-training
training

Train styles, people and other subjects at blazing speeds.

flux
lora
fast
fal-ai/flux-lora-portrait-trainer
training

FLUX LoRA training optimized for portrait generation, with bright highlights, excellent prompt following and highly detailed results.

flux
lora
finetuning
fal-ai/recraft-v3
text-to-image

Recraft V3 is a text-to-image model with the ability to generate long texts, vector art, images in brand style, and much more. As of today, it is SOTA in image generation, proven by Hugging Face’s industry-leading Text-to-Image Benchmark by Artificial Analysis.

image generation
vector art
typograph
fal-ai/minimax-video/image-to-video
image-to-video

Generate video clips from your images using MiniMax Video model

video generation
minimax
image to video
fal-ai/aura-flow
text-to-image

AuraFlow v0.3 is an open-source flow-based text-to-image generation model that achieves state-of-the-art results on GenEval. The model is currently in beta.

image generation
typograph
high quality
fal-ai/flux/dev/image-to-image
image-to-image

FLUX.1 Image-to-Image is a high-performance endpoint for the FLUX.1 [dev] model that enables rapid transformation of existing images, delivering high-quality style transfers and image modifications with the core FLUX capabilities.

image generation
flux
dev
fal-ai/flux-lora
text-to-image

Super fast endpoint for the FLUX.1 [dev] model with LoRA support, enabling rapid and high-quality image generation using pre-trained LoRA adaptations for personalization, specific styles, brand identities, and product-specific outputs.

flux
dev
lora
fal-ai/flux-lora/inpainting
text-to-image

Super fast endpoint for the FLUX.1 [dev] inpainting model with LoRA support, enabling rapid and high-quality image inpaingting using pre-trained LoRA adaptations for personalization, specific styles, brand identities, and product-specific outputs.

flux
dev
lora

All Models

Explore all available models provided by fal.ai

background texture
fal-ai/hyper3d/rodin
image-to-3d

Rodin by Hyper3D generates realistic and production ready 3D models from text or images.

stylized
background texture
fal-ai/flux/dev
text-to-image

FLUX.1 [dev] is a 12 billion parameter flow transformer that generates high-quality images from text. It is suitable for personal and commercial use.

image generation
flux
dev
generation
high quality
fast
optimized
background texture
fal-ai/flux/schnell
text-to-image

FLUX.1 [schnell] is a 12 billion parameter flow transformer that generates high-quality images from text in 1 to 4 steps, suitable for personal and commercial use.

flux
schnell
high quality
optimized
fast
optimized
background texture
fal-ai/flux-subject
text-to-image

Super fast endpoint for the FLUX.1 [schnell] model with subject input capabilities, enabling rapid and high-quality image generation for personalization, specific styles, brand identities, and product-specific outputs.

flux
schnell
stylization
personalization
high quality
ipadapter
customization
background texture
fal-ai/flux/schnell/redux
image-to-image

FLUX.1 [schnell] Redux is a high-performance endpoint for the FLUX.1 [schnell] model that enables rapid transformation of existing images, delivering high-quality style transfers and image modifications with the core FLUX capabilities.

flux
schnell
image transformation
style transfer
high performance
fast
background texture
fal-ai/flux/dev/redux
image-to-image

FLUX.1 [dev] Redux is a high-performance endpoint for the FLUX.1 [dev] model that enables rapid transformation of existing images, delivering high-quality style transfers and image modifications with the core FLUX capabilities.

flux
dev
image transformation
style transfer
lora
background texture
fal-ai/flux-pro/v1/redux
image-to-image

FLUX.1 [pro] Redux is a high-performance endpoint for the FLUX.1 [pro] model that enables rapid transformation of existing images, delivering high-quality style transfers and image modifications with the core FLUX capabilities.

flux
pro
image transformation
style transfer
background texture
fal-ai/flux-pro/v1.1/redux
image-to-image

FLUX1.1 [pro] Redux is a high-performance endpoint for the FLUX1.1 [pro] model that enables rapid transformation of existing images, delivering high-quality style transfers and image modifications with the core FLUX capabilities.

flux
pro
image transformation
style transfer
enhanced
background texture
fal-ai/flux-pro/v1.1-ultra/redux
image-to-image

FLUX1.1 [pro] ultra Redux is a high-performance endpoint for the FLUX1.1 [pro] model that enables rapid transformation of existing images, delivering high-quality style transfers and image modifications with the core FLUX capabilities.

flux
pro ultra
image transformation
style transfer
high resolution
background texture
fal-ai/flux-pro/v1/fill
image-to-image

FLUX.1 [pro] Fill is a high-performance endpoint for the FLUX.1 [pro] model that enables rapid transformation of existing images, delivering high-quality style transfers and image modifications with the core FLUX capabilities.

flux
pro
image editing
inpainting
high performance
background texture
fal-ai/flux-pro/v1/canny
image-to-image

Utilize Flux.1 [pro] Controlnet to generate high-quality images with precise control over composition, style, and structure through advanced edge detection and guidance mechanisms.

flux
pro
controlnet
edge detection
image editing
composition control
image conditioning
background texture
fal-ai/flux-pro/v1/depth
image-to-image

Generate high-quality images from depth maps using Flux.1 [pro] depth estimation model. The model produces accurate depth representations for scene understanding and 3D visualization.

flux
pro
depth estimation
3d visualization
utility
high quality
image conditioning
background texture
fal-ai/flux-lora-canny
image-to-image

Utilize Flux.1 [dev] Controlnet to generate high-quality images with precise control over composition, style, and structure through advanced edge detection and guidance mechanisms.

flux
dev
controlnet
edge detection
lora
image editing
image conditioning
background texture
fal-ai/flux-lora-depth
image-to-image

Generate high-quality images from depth maps using Flux.1 [dev] depth estimation model. The model produces accurate depth representations for scene understanding and 3D visualization.

flux
dev
depth estimation
3d visualization
lora
utility
image conditioning
background texture
fal-ai/flux-pro/v1.1
text-to-image

FLUX1.1 [pro] is an enhanced version of FLUX.1 [pro], improved image generation capabilities, delivering superior composition, detail, and artistic fidelity compared to its predecessor.

flux
pro
enhanced
high quality
composition
artistic fidelity
premium quality
background texture
fal-ai/flux-pro/new
text-to-image

FLUX.1 [pro] new is an accelerated version of FLUX.1 [pro], maintaining professional-grade image quality while delivering significantly faster generation speeds.

flux
pro
text to image
high quality
accelerated
fast
background texture
fal-ai/sana
text-to-image

Sana can synthesize high-resolution, high-quality images with strong text-image alignment at a remarkably fast speed, with the ability to generate 4K images in less than a second.

optimized
background texture
fal-ai/omnigen-v1
text-to-image

OmniGen is a unified image generation model that can generate a wide range of images from multi-modal prompts. It can be used for various tasks such as Image Editing, Personalized Image Generation, Virtual Try-On, Multi Person Generation and more!

multimodal
image editing
personalized generation
virtual try-On
multi-person generation
background texture
fal-ai/stable-diffusion-v35-large
text-to-image

Stable Diffusion 3.5 Large is a Multimodal Diffusion Transformer (MMDiT) text-to-image model that features improved performance in image quality, typography, complex prompt understanding, and resource-efficiency.

stable diffusion
typograph
high quality
composition
style
background texture
fal-ai/stable-diffusion-v35-medium
text-to-image

Stable Diffusion 3.5 Medium is a Multimodal Diffusion Transformer (MMDiT) text-to-image model that features improved performance in image quality, typography, complex prompt understanding, and resource-efficiency.

stable diffusion
typograph
high quality
composition
style
background texture
fal-ai/recraft-v3/create-style
training

Recraft V3 Create Style is capable of creating unique styles for Recraft V3 based on your images.

recraft
style training
vector art
personalization
finetuning
background texture
fal-ai/flux-realism
text-to-image

FLUX Realism LoRA is a specialized fine-tuning adaptation that enhances FLUX models to produce hyper-realistic images with exceptional detail, accurate lighting, and true-to-life textures. Optimized for photographic quality and real-world accuracy.

flux
dev
lora
realism
finetuning
high detail
photorealistic
background texture
fal-ai/flux-lora-fill
image-to-image

FLUX.1 [dev] Fill is a high-performance endpoint for the FLUX.1 [pro] model that enables rapid transformation of existing images, delivering high-quality style transfers and image modifications with the core FLUX capabilities.

flux
dev
image editing
inpainting
high performance
lora
background texture
fal-ai/flux-lora/image-to-image
image-to-image

FLUX LoRA Image-to-Image is a high-performance endpoint that transforms existing images using FLUX models, leveraging LoRA adaptations to enable rapid and precise image style transfer, modifications, and artistic variations.

flux
dev
lora
image transformation
style transfer
high performance
background texture
fal-ai/flux-general
text-to-image

A versatile endpoint for the FLUX.1 [dev] model that supports multiple AI extensions including LoRA, ControlNet conditioning, and IP-Adapter integration, enabling comprehensive control over image generation through various guidance methods.

flux
dev
lora
controlnet
ip-adapter
multi-extension support
reference only
background texture
fal-ai/flux-general/inpainting
image-to-image

FLUX General Inpainting is a versatile endpoint that enables precise image editing and completion, supporting multiple AI extensions including LoRA, ControlNet, and IP-Adapter for enhanced control over inpainting results and sophisticated image modifications.

flux
dev
lora
controlnet
ip-adapter
inpainting
background texture
fal-ai/flux-general/image-to-image
image-to-image

FLUX General Image-to-Image is a versatile endpoint that transforms existing images with support for LoRA, ControlNet, and IP-Adapter extensions, enabling precise control over style transfer, modifications, and artistic variations through multiple guidance methods.

flux
dev
lora
controlnet
ip-adapter
image transformation
background texture
fal-ai/flux-general/differential-diffusion
image-to-image

A specialized FLUX endpoint combining differential diffusion control with LoRA, ControlNet, and IP-Adapter support, enabling precise, region-specific image transformations through customizable change maps.

flux
dev
differential diffusion
lora
controlnet
ip-adapter
background texture
fal-ai/flux-general/rf-inversion
image-to-image

A general purpose endpoint for the FLUX.1 [dev] model, implementing the RF-Inversion pipeline. This can be used to edit a reference image based on a prompt.

flux
dev
rf-inversion
image editing
prompt based editing
lora
background texture
fal-ai/flux-pulid
image-to-image

An endpoint for personalized image generation using Flux as per given description.

flux
dev
pulid
personalized generation
image based
high quality
finetuning
background texture
fal-ai/iclight-v2
image-to-image

An endpoint for re-lighting photos and changing their backgrounds per a given description

flux
dev
image relighting
background change
image enhancement
photo editing
background texture
fal-ai/flux-differential-diffusion
image-to-image

FLUX.1 Differential Diffusion is a rapid endpoint that enables swift, granular control over image transformations through change maps, delivering fast and precise region-specific modifications while maintaining FLUX.1 [dev]'s high-quality output.

flux
dev
differential diffusion
fast
granular control
image transformation
background texture
fal-ai/stable-diffusion-v3-medium
text-to-image

Stable Diffusion 3 Medium (Text to Image) is a Multimodal Diffusion Transformer (MMDiT) model that improves image quality, typography, prompt understanding, and efficiency.

stable diffusion
diffusion model
image generation
optimized
composition
style
background texture
fal-ai/stable-diffusion-v3-medium/image-to-image
image-to-image

Stable Diffusion 3 Medium (Image to Image) is a Multimodal Diffusion Transformer (MMDiT) model that improves image quality, typography, prompt understanding, and efficiency.

stable diffusion
diffusion model
image editing
optimized
composition
style
background texture
fal-ai/fast-sdxl
text-to-image

Run SDXL at the speed of light

stable diffusion
lora
embeddings
high resolution
fast
text to image
composition
style
background texture
fal-ai/lora
text-to-image

Run Any Stable Diffusion model with customizable LoRA weights.

stable diffusion
lora
stylization
customization
text to image
finetuning
background texture
fal-ai/aura-sr
image-to-image

Upscale your images with AuraSR.

upscaling
high Fidelity
image enhancement
super-resolution
optimized
background texture
fal-ai/stable-cascade
text-to-image

Stable Cascade: Image generation on a smaller & cheaper latent space.

stable diffusion
lcm
stylization
efficient
image generation
background texture
fal-ai/minimax-video
text-to-video

Generate video clips from your prompts using MiniMax model

video generation
minimax
text to video
ai video
motion
transformation
background texture
fal-ai/haiper-video-v2
text-to-video

Transform text into hyper-realistic videos with Haiper 2.0. Experience industry-leading resolution, fluid motion, and rapid generation for stunning AI videos.

haiper
video generation
text to video
ai video
hyperrealistic
motion
background texture
fal-ai/haiper-video-v2/image-to-video
image-to-video

Transform text into hyper-realistic videos with Haiper 2.0. Experience industry-leading resolution, fluid motion, and rapid generation for stunning AI videos.

haiper
video generation
image to video
ai video
hyperrealistic
motion
background texture
fal-ai/mochi-v1
text-to-video

Mochi 1 preview is an open state-of-the-art video generation model with high-fidelity motion and strong prompt adherence in preliminary evaluation.

video generation
text to video
ai video
high fidelity motion
fast
optimized
background texture
fal-ai/hunyuan-video
text-to-video

Hunyuan Video is an Open video generation model with high visual quality, motion diversity, text-video alignment, and generation stability

video generation
text to video
ai video
high fidelity motion
fast
optimized
background texture
fal-ai/luma-dream-machine
text-to-video

Generate video clips from your prompts using Luma Dream Machine v1.5

video generation
text to video
luma
ai video
motion
transformation
background texture
fal-ai/luma-dream-machine/image-to-video
image-to-video

Generate video clips from your images using Luma Dream Machine v1.5

video generation
image to video
luma
ai video
motion
transformation
background texture
fal-ai/luma-photon
text-to-image

Generate images from your prompts using Luma Photon. Photon is the most creative, personalizable, and intelligent visual models for creatives, bringing a step-function change in the cost of high-quality image generation.

image generation
text to image
luma
ai image
background texture
fal-ai/luma-photon/flash
text-to-image

Generate images from your prompts using Luma Photon Flash. Photon Flash is the most creative, personalizable, and intelligent visual models for creatives, bringing a step-function change in the cost of high-quality image generation.

image generation
text to image
luma
ai image
fast
background texture
fal-ai/kling-video/v1/standard/text-to-video
text-to-video

Generate video clips from your prompts using Kling 1.0

video generation
text to video
kling
ai video
standard
motion
background texture
fal-ai/kling-video/v1/standard/image-to-video
image-to-video

Generate video clips from your images using Kling 1.0

video generation
image to video
kling
ai video
standard
motion
background texture
fal-ai/kling-video/v1/pro/text-to-video
text-to-video

Generate video clips from your prompts using Kling 1.0 (pro)

video generation
text to video
kling
ai video
professional
motion
background texture
fal-ai/kling-video/v1/pro/image-to-video
image-to-video

Generate video clips from your images using Kling 1.0 (pro)

video generation
image to video
kling
ai video
professional
motion
background texture
fal-ai/kling-video/v1.5/pro/image-to-video
image-to-video

Generate video clips from your images using Kling 1.5 (pro)

video generation
image to video
kling
ai video
professional
improved
background texture
fal-ai/kling-video/v1.5/pro/text-to-video
text-to-video

Generate video clips from your prompts using Kling 1.5 (pro)

video generation
text to video
kling
ai video
professional
improved
background texture
fal-ai/cogvideox-5b
text-to-video

Generate videos from prompts using CogVideoX-5B

video generation
text to video
cogvideox
optimized
video processing
ai video
background texture
fal-ai/cogvideox-5b/video-to-video
video-to-video

Generate videos from videos and prompts using CogVideoX-5B

video processing
video to video
cogvideox
optimized
video editing
ai video
background texture
fal-ai/cogvideox-5b/image-to-video
image-to-video

Generate videos from images and prompts using CogVideoX-5B

video generation
image to video
cogvideox
optimized
video processing
ai video
background texture
fal-ai/ltx-video
text-to-video

Generate videos from prompts using LTX Video

video generation
text to video
ltx
optimized
fast
ai video
background texture
fal-ai/ltx-video/image-to-video
image-to-video

Generate videos from images using LTX Video

video generation
image to video
ltx
optimized
fast
ai video
background texture
fal-ai/stable-video
image-to-video

Generate short video clips from your images using SVD v1.1

video generation
image to video
stable video diffusion (svg)
optimized
ai video
fast
background texture
fal-ai/fast-svd/text-to-video
text-to-video

Generate short video clips from your prompts using SVD v1.1

video generation
text to video
stable video diffusion (svg)
optimized
ai video
fast
background texture
fal-ai/fast-svd-lcm
image-to-video

Generate short video clips from your images using SVD v1.1 at Lightning Speed

video generation
image to video
stable video diffusion (svg)
optimized
ai video
turbo
background texture
fal-ai/birefnet/v2
image-to-image

bilateral reference framework (BiRefNet) for high-resolution dichotomous image segmentation (DIS)

background removal
image segmentation
high resolution
utility
improved
rembg
background texture
fal-ai/fast-svd-lcm/text-to-video
text-to-video

Generate short video clips from your images using SVD v1.1 at Lightning Speed

video generation
image to video
lcm
stable video diffusion (svg)
optimized
ai video
turbo
background texture
fal-ai/creative-upscaler
image-to-image

Create creative upscaled images.

upscaling
creative
image enhancement
super resolution
high hidelity
optimized
background texture
fal-ai/clarity-upscaler
image-to-image

Clarity upscaler for images with high fidelity.

upscaling
high Fidelity
image enhancement
super resolution
clarity enhancement
optimized
background texture
fal-ai/ccsr
image-to-image

SOTA Image Upscaler

upscaling
image enhancement
super resolution
high hidelity
optimized
background texture
fal-ai/fast-turbo-diffusion
text-to-image

Run SDXL at the speed of light

stable diffusion
turbo
real-time
optimized
fast
high quality
background texture
fal-ai/fast-turbo-diffusion/image-to-image
image-to-image

Run SDXL at the speed of light

stable diffusion
turbo
real-time
optimized
fast
image editing
background texture
fal-ai/fast-turbo-diffusion/inpainting
image-to-image

Run SDXL at the speed of light

stable diffusion
turbo
real-time
inpainting
optimized
fast
background texture
fal-ai/fast-lcm-diffusion
text-to-image

Run SDXL at the speed of light

lcm
stable diffusion
turbo
real-time
optimized
fast
background texture
fal-ai/fast-lcm-diffusion/image-to-image
image-to-image

Run SDXL at the speed of light

lcm
stable diffusion
turbo
real-time
optimized
image editing
fast
background texture
fal-ai/fast-lcm-diffusion/inpainting
image-to-image

Run SDXL at the speed of light

lcm
stable diffusion
turbo
real-time
optimized
image editing
fast
background texture
fal-ai/whisper
speech-to-text

Whisper is a model for speech transcription and translation.

speech to text
transcription
translation
whisper v3
audio processing
optimized
background texture
fal-ai/wizper
speech-to-text

[Experimental] Whisper v3 Large -- but optimized by our inference wizards. Same WER, double the performance!

speech to text
transcription
whisper v3
audio processing
optimized
fast
background texture
fal-ai/fast-lightning-sdxl
text-to-image

Run SDXL at the speed of light

stable diffusion
lightning
real-time
optimized
fast
background texture
fal-ai/fast-lightning-sdxl/image-to-image
image-to-image

Run SDXL at the speed of light

stable diffusion
lightning
optimized
fast
image editing
background texture
fal-ai/fast-lightning-sdxl/inpainting
image-to-image

Run SDXL at the speed of light

stable diffusion
lightning
optimized
fast
inpainting
background texture
fal-ai/hyper-sdxl
text-to-image

Hyper-charge SDXL's performance and creativity.

stable diffusion
hyper
real-time
optimized
fast
background texture
fal-ai/hyper-sdxl/image-to-image
image-to-image

Hyper-charge SDXL's performance and creativity.

stable diffusion
hyper
optimized
fast
image editing
background texture
fal-ai/hyper-sdxl/inpainting
image-to-image

Hyper-charge SDXL's performance and creativity.

stable diffusion
hyper
optimized
fast
inpainting
background texture
fal-ai/playground-v25
text-to-image

State-of-the-art open-source model in aesthetic quality

text to image
playground v2.5
artistic
aesthetic quality
composition
style
background texture
fal-ai/playground-v25/image-to-image
image-to-image

State-of-the-art open-source model in aesthetic quality

image to image
playground v2.5
artistic
aesthetic quality
composition
style
background texture
fal-ai/playground-v25/inpainting
image-to-image

State-of-the-art open-source model in aesthetic quality

inpainting
playground v2.5
artistic
aesthetic quality
composition
style
background texture
fal-ai/amt-interpolation
video-to-video

Interpolate between video frames

video interpolation
amt
video processing
frame interpolation
motion smoothing
video editing
background texture
fal-ai/amt-interpolation/frame-interpolation
image-to-video

Interpolate between image frames

video interpolation
amt
video processing
frame interpolation
motion smoothing
video editing
background texture
fal-ai/t2v-turbo
text-to-video

Generate short video clips from your prompts

video generation
text to video
ai video
turbo
fast
background texture
fal-ai/sd15-depth-controlnet
image-to-image

SD 1.5 ControlNet

stable diffusion
depth controlnet
image editing
depth manipulation
image manipulation
controlnet
background texture
fal-ai/photomaker
image-to-image

Customizing Realistic Human Photos via Stacked ID Embedding

image editing
customization
photorealistic
personalization
background texture
fal-ai/lcm
text-to-image

Produce high-quality images with minimal inference steps.

text to image
stable diffusion
lcm
real-time
optimized
background texture
fal-ai/lcm-sd15-i2i
image-to-image

Produce high-quality images with minimal inference steps. Optimized for 512x512 input image size.

image to image
stable diffusion
lcm
real-time
optimized
background texture
fal-ai/fooocus
text-to-image

Default parameters with automated optimizations and quality improvements.

text to image
fooocus
stylized
optimized
background texture
fal-ai/animatediff-v2v
video-to-video

Re-animate your videos with evolved consistency!

video to video
animatediff
animation
stylized
video processing
background texture
fal-ai/animatediff-v2v/turbo
video-to-video

Re-animate your videos with evolved consistency!

video to video
animatediff
animation
stylized
video processing
fast
turbo
background texture
fal-ai/fast-animatediff/text-to-video
text-to-video

Animate your ideas!

video to video
animatediff
animation
stylized
video processing
fast
background texture
fal-ai/fast-animatediff/video-to-video
video-to-video

Re-animate your videos!

video to video
animatediff
animation
stylized
video processing
ai video
fast
background texture
fal-ai/fast-animatediff/turbo/text-to-video
text-to-video

Animate your ideas in lightning speed!

text to video
animatediff
animation
stylized
video processing
ai video
fast
turbo
background texture
fal-ai/fast-animatediff/turbo/video-to-video
video-to-video

Re-animate your videos in lightning speed!

video to video
animatediff
animation
stylized
video processing
ai video
fast
turbo
background texture
fal-ai/illusion-diffusion
text-to-image

Create illusions conditioned on image.

text to image
image generation
image conditioning
stylized
diffusion model
ai art
background texture
fal-ai/imageutils/depth
image-to-image

Create depth maps using Midas depth estimation.

depth estimation
midas
utility
depth map generation
3d vision
image analysis
background texture
fal-ai/imageutils/rembg
image-to-image

Remove the background from an image.

background removal
utility
image editing
background subtraction
optimized
background texture
fal-ai/esrgan
image-to-image

Upscale images by a given factor.

upscaling
image enhancement
super-resolution
high resolution
optimized
background texture
fal-ai/fast-sdxl-controlnet-canny
text-to-image

Generate Images with ControlNet.

stable diffusion
controlnet
text to image
image generation
image manipulation
sdxl
fast
background texture
fal-ai/fast-sdxl-controlnet-canny/image-to-image
image-to-image

Generate Images with ControlNet.

stable diffusion
controlnet
image to image
image editing
image manipulation
sdxl
background texture
fal-ai/fast-sdxl-controlnet-canny/inpainting
image-to-image

Generate Images with ControlNet.

stable diffusion
controlnet
inpainting
image editing
image manipulation
sdxl
background texture
fal-ai/inpaint
image-to-image

Inpaint images with SD and SDXL

inpainting
image editing
stable diffusion
sdxl
image restoration
background texture
fal-ai/animatediff-sparsectrl-lcm
text-to-video

Animate Your Drawings with Latent Consistency Models!

text to video
animatediff
sparsectrl
lcm
animation
stylized
background texture
fal-ai/pulid
image-to-image

Tuning-free ID customization.

pulid
image editing
customization
personalization
sdxl
background texture
fal-ai/ip-adapter-face-id
image-to-image

High quality zero-shot personalization

ip-adapter
personalization
customization
image editing
background texture
fal-ai/imageutils/marigold-depth
image-to-image

Create depth maps using Marigold depth estimation.

depth estimation
marigold
utility
depth map generation
3d vision
image analysis
background texture
fal-ai/stable-audio
text-to-audio

Open source text-to-audio model.

text to audio
audio generation
audio synthesis
music generation
background texture
fal-ai/diffusion-edge
text-to-image

Diffusion based high quality edge detection

edge detection
image analysis
background texture
fal-ai/triposr
image-to-3d

State of the art Image to 3D Object generation

image to 3d
3d object generation
3d modeling
triposr
background texture
fal-ai/fooocus/upscale-or-vary
text-to-image

Default parameters with automated optimizations and quality improvements.

text to image
fooocus
upscaling
image variation
stylized
optimized
background texture
fal-ai/fooocus/image-prompt
text-to-image

Default parameters with automated optimizations and quality improvements.

text to image
fooocus
image prompt
image to image
stylized
optimized
background texture
fal-ai/fooocus/inpaint
text-to-image

Default parameters with automated optimizations and quality improvements.

text to image
fooocus
inpainting
stylized
optimized
image editing
background texture
fal-ai/retoucher
image-to-image

Automatically retouches faces to smooth skin and remove blemishes.

face retoucher
image editing
enhancement
face manipulation
background texture
fal-ai/any-llm
llm

Use any large language model from our selected catalogue (powered by OpenRouter)

large language model (LLM)
text to text
text generation
language modeling
streaming
background texture
fal-ai/any-llm/vision
vision

Use any vision language model from our selected catalogue (powered by OpenRouter)

vision language model (VLM)
image to text
multimodal
vision
language
streaming
image understanding
background texture
fal-ai/llavav15-13b
vision

Vision

vision language model (VLM)
llava
multimodal
vision
language
image understanding
background texture
fal-ai/llava-next
vision

Vision

vision language model (VLM)
llava
multimodal
vision
language
image understanding
background texture
fal-ai/imageutils/nsfw
vision

Predict the probability of an image being NSFW.

nsfw filter
image classification
safety
content moderation
utility
background texture
fal-ai/fast-fooocus-sdxl
text-to-image

Fooocus extreme speed mode as a standalone app.

text to image
fooocus
sdxl
fast
optimized
Stylized
background texture
fal-ai/fast-fooocus-sdxl/image-to-image
text-to-image

Fooocus extreme speed mode as a standalone app.

image to image
fooocus
sdxl
fast
optimized
stylized
background texture
fal-ai/face-to-sticker
image-to-image

Create stickers from faces.

sticker generation
face detection
image editing
fun effects
background texture
fal-ai/moondream/batched
vision

Answer questions from the images.

vision language model (VLM)
moondream
multimodal
vision
language
image understanding
background texture
fal-ai/sadtalker
image-to-video

Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

talking face animation
audio-driven animation
realistic animation
image to video
background texture
fal-ai/musetalk
image-to-video

MuseTalk is a real-time high quality audio-driven lip-syncing model. Use MuseTalk to animate a face with your own audio.

talking face animation
audio-driven animation
lip sync
real-time
high quality
background texture
fal-ai/sadtalker/reference
image-to-video

Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

talking face animation
audio-driven animation
realistic animation
reference based
background texture
fal-ai/layer-diffusion
text-to-image

SDXL with an alpha channel.

text to image
sdxl
image generation
image synthesis
diffusion model
background texture
fal-ai/stable-diffusion-v15
text-to-image

Stable Diffusion v1.5

text to image
stable diffusion
image generation
diffusion
background texture
fal-ai/lora/image-to-image
image-to-image

Run Any Stable Diffusion model with customizable LoRA weights.

stable diffusion
lora
image to image
stylization
customization
fine-tuning
background texture
fal-ai/fast-sdxl/image-to-image
image-to-image

Run SDXL at the speed of light

stable diffusion
sdxl
image to image
high resolution
fast
lora
ip-adapter
controlnet
background texture
fal-ai/fast-sdxl/inpainting
image-to-image

Run SDXL at the speed of light

stable diffusion
sdxl
inpainting
high resolution
fast
lora
ip-adapter
controlnet
background texture
fal-ai/lora/inpaint
image-to-image

Run Any Stable Diffusion model with customizable LoRA weights.

stable diffusion
lora
inpainting
stylization
customization
fine-tuning
background texture
fal-ai/pixart-sigma
text-to-image

Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

text to image
pixart sigma
generation
diffusion
4k
background texture
fal-ai/dreamshaper
text-to-image

Dreamshaper model.

text to image
dreamshaper
stylized
image generation
diffusion
background texture
fal-ai/realistic-vision
text-to-image

Generate realistic images.

text to image
realistic vision
photorealistic
image generation
diffusion
background texture
fal-ai/lightning-models
text-to-image

Collection of SDXL Lightning models.

text to image
sdxl lightning
stable diffusion
fast
optimized
background texture
fal-ai/omni-zero
image-to-image

Any pose, any style, any identity

image to image
omni zero
style transfer
identity transfer
background texture
fal-ai/cat-vton
image-to-image

Image based Virtual Try-On

image to image
virtual try-on
fashion
clothing
background texture
fal-ai/dwpose
image-to-image

Predict poses.

pose prediction
pose estimation
utility
image analysis
background texture
fal-ai/stable-cascade/sote-diffusion
text-to-image

Anime finetune of Würstchen V3.

text to image
sotediffusion
anime style
finetuned
lcm
stylized
background texture
fal-ai/florence-2-large/caption
vision

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

image captioning
florence-2
vision language model (VLM)
multimodal
vision
language
image understanding
background texture
fal-ai/florence-2-large/detailed-caption
vision

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

image captioning
florence-2
vision language model (VLM)
multimodal
vision
language
image understanding
background texture
fal-ai/florence-2-large/more-detailed-caption
vision

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

image captioning
florence-2
vision language model (VLM)
multimodal
vision
language
image understanding
background texture
fal-ai/florence-2-large/object-detection
image-to-image

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

object detection
florence-2
vision language model (VLM)
multimodal
vision
image analysis
image understanding
background texture
fal-ai/florence-2-large/dense-region-caption
image-to-image

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

dense region captioning
florence-2
vision language model (VLM)
multimodal
vision
language
image understanding
background texture
fal-ai/florence-2-large/region-proposal
image-to-image

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

region proposal
florence-2
vision language model (VLM)
multimodal
vision
image analysis
image understanding
background texture
fal-ai/florence-2-large/caption-to-phrase-grounding
image-to-image

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

phrase grounding
florence-2
vision language model (VLM)
multimodal
vision
language
background texture
fal-ai/florence-2-large/referring-expression-segmentation
image-to-image

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

referring expression segmentation
florence-2
vision language model (VLM)
multimodal
vision
image segmentation
image understanding
background texture
fal-ai/florence-2-large/region-to-segmentation
image-to-image

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

region to segmentation
florence-2
vision language model (VLM)
multimodal
vision
image segmentation
image understanding
background texture
fal-ai/florence-2-large/open-vocabulary-detection
image-to-image

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

open vocabulary detection
florence-2
vision language model (VLM)
multimodal
vision
object detection
image understanding
background texture
fal-ai/florence-2-large/region-to-category
vision

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

region to category
florence-2
vision language model (VLM)
multimodal
vision
image classification
image understanding
background texture
fal-ai/florence-2-large/region-to-description
vision

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

region to description
florence-2
vision language model (VLM)
multimodal
vision
language
image understanding
background texture
fal-ai/florence-2-large/ocr
vision

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

ocr
florence-2
vision language model (VLM)
multimodal
vision
text extraction
image understanding
background texture
fal-ai/florence-2-large/ocr-with-region
image-to-image

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

ocr
florence-2
vision language model (VLM)
multimodal
vision
text extraction
image understanding
background texture
fal-ai/era-3d
image-to-image

A powerful image to novel multiview model with normals.

image to 3d
3d model generation
multiview
era 3d
background texture
fal-ai/live-portrait
image-to-video

Transfer expression from a video to a portrait.

expression transfer
video to image
portrait animation
face animation
live portrait
background texture
fal-ai/live-portrait/image
image-to-image

Transfer expression from a video to a portrait.

expression transfer
image to image
portrait animation
face animation
live portrait
background texture
fal-ai/kolors
text-to-image

Photorealistic Text-to-Image

text to image
kolors
photorealistic
image generation
diffusion
high quality
background texture
fal-ai/kolors/image-to-image
image-to-image

Photorealistic Image-to-Image

image to image
kolors
photorealistic
image editing
diffusion
high quality
background texture
fal-ai/sdxl-controlnet-union
text-to-image

An efficent SDXL multi-controlnet text-to-image model.

stable diffusion
sdxl
controlnet
image conditioning
text to image
efficient
background texture
fal-ai/sdxl-controlnet-union/image-to-image
image-to-image

An efficent SDXL multi-controlnet image-to-image model.

stable diffusion
sdxl
controlnet
image conditioning
image to image
efficient
background texture
fal-ai/sdxl-controlnet-union/inpainting
image-to-image

An efficent SDXL multi-controlnet inpainting model.

stable diffusion
sdxl
controlnet
image conditioning
inpainting
efficient
background texture
fal-ai/sam2/image
image-to-image

SAM 2 is a model for segmenting images and videos in real-time.

image segmentation
segment anything model (sam)
mask generation
Interactive Segmentation
real-time
sam2
background texture
fal-ai/sam2/video
video-to-video

SAM 2 is a model for segmenting images and videos in real-time.

Video Segmentation
segment anything model (sam)
mask generation
Interactive Segmentation
real-time
sam2
background texture
fal-ai/mini-cpm
vision

Multimodal vision-language model for single/multi image understanding

vision language model (vllm)
minicpm
multimodal
image understanding
text generation
vision
background texture
fal-ai/mini-cpm/video
vision

Multimodal vision-language model for video understanding

vision language model (vllm)
minicpm
multimodal
video understanding
text generation
vision
background texture
fal-ai/controlnext
video-to-video

Animate a reference image with a driving video using ControlNeXt.

video to video
controlnext
image animation
motion transfer
svd
background texture
fal-ai/workflowutils/canny
image-to-image

Various image preprocessing tools for ControlNet and other applications.

image preprocessing
controlnet
utility
image editing
image manipulation
background texture
fal-ai/workflowutils/canny
image-to-image

Canny edge detection preprocessor.

canny edge detection
image preprocessing
controlnet
utility
edge detection
legacy
background texture
fal-ai/image-preprocessors/depth-anything/v2
image-to-image

Depth Anything v2 preprocessor.

depth estimation
image preprocessing
depth anything v2
utility
depth map generation
controlnet
background texture
fal-ai/image-preprocessors/hed
image-to-image

Holistically-Nested Edge Detection (HED) preprocessor.

hed
image preprocessing
edge detection
utility
image analysis
controlnet
background texture
fal-ai/image-preprocessors/lineart
image-to-image

Line art preprocessor.

line art extraction
image preprocessing
utility
line art
sketch extraction
controlnet
background texture
fal-ai/image-preprocessors/midas
image-to-image

MiDaS depth estimation preprocessor.

depth estimation
image preprocessing
midas
utility
depth map generation
controlnet
background texture
fal-ai/image-preprocessors/mlsd
image-to-image

M-LSD line segment detection preprocessor.

image preprocessing
m-lsd
utility
line detection
controlnet
background texture
fal-ai/image-preprocessors/pidi
image-to-image

PIDI (Pidinet) preprocessor.

edge detection
image preprocessing
pidi (pidinet)
utility
image analysis
controlnet
background texture
fal-ai/image-preprocessors/sam
image-to-image

Segment Anything Model (SAM) preprocessor.

image segmentation
image preprocessing
segment anything model (sam)
utility
mask generation
controlnet
background texture
fal-ai/image-preprocessors/scribble
image-to-image

Scribble preprocessor.

scribble preprocessing
image preprocessing
utility
image editing
controlnet
sketch based
background texture
fal-ai/image-preprocessors/teed
image-to-image

TEED (Temporal Edge Enhancement Detection) preprocessor.

teed
image preprocessing
edge detection
utility
controlnet
background texture
fal-ai/image-preprocessors/zoe
image-to-image

ZoeDepth preprocessor.

depth estimation
image preprocessing
zoedepth
utility
depth map generation
controlnet
background texture
fal-ai/f5-tts
text-to-audio

F5 TTS

text to speech (tts)
f5
audio generation
speech synthesis
high quality