Model registry

See all available model APIs provided by fal.ai

Show
Stable Diffusion XL

Run SDXL at the speed of light

text-to-image
inference
Stable Diffusion with LoRAs

Run Any Stable Diffusion model with customizable LoRA weights.

text-to-image
inference
stylized
Stable Cascade

Stable Cascade: Image generation on a smaller & cheaper latent space.

text-to-image
inference
stylized
Stable Video Diffusion

Generate short video clips from your images using SVD v1.1

image-to-video
inference
Stable Video Diffusion Turbo

Generate short video clips from your images using SVD v1.1 at Lightning Speed

image-to-video
inference
Creative Upscaler

Create creative upscaled images.

image-to-image
inference
utility
CCSR Upscaler

SOTA Image Upscaler

image-to-image
inference
utility
Stable Diffusion Turbo (v1.5/XL)

Run SDXL at the speed of light

text-to-image
inference
real-time
Latent Consistency Models (v1.5/XL)

Run SDXL at the speed of light

text-to-image
inference
real-time
Stable Diffusion XL Lightning

Run SDXL at the speed of light

text-to-image
inference
real-time
PhotoMaker

Customizing Realistic Human Photos via Stacked ID Embedding

image-to-image
inference
Whisper

Whisper is a model for speech transcription and translation.

speech-to-text
inference
speech
Latent Consistency (SDXL & SDv1.5)

Produce high-quality images with minimal inference steps.

text-to-image
inference
real-time
Optimized Latent Consistency (SDv1.5)

Produce high-quality images with minimal inference steps. Optimized for 512x512 input image size.

image-to-image
inference
real-time
Fooocus

Default parameters with automated optimizations and quality improvements.

text-to-image
inference
stylized
InstantID

Zero-shot Identity-Preserving Generation in Seconds

image-to-image
inference
stylized
AnimateDiff Video-to-Video Evolved

Re-animate your videos with evolved consistency!

video-to-video
inference
stylized
AnimateDiff

Animate your ideas!

text-to-video
inference
stylized
AnimateDiff Turbo

Animate your ideas in lightning speed!

text-to-video
inference
stylized
MetaVoice

MetaVoice-1B is a 1.2B parameter base model trained on 100K hours of speech for TTS (text-to-speech).

text-to-speech
inference
stylized
MusicGen

Create high-quality music by taking cues from text descriptions or melodies.

text-to-audio
inference
stylized
Illusion Diffusion

Create illusions conditioned on image.

text-to-image
inference
stylized
Comfy Workflow Executor

Execute Comfy workflows in fal.

json-to-image
inference
Segment Anything Model

SAM.

image-to-image
inference
masks
TinySAM Distilled Segment Anything Model

TinySAM.

image-to-image
inference
masks
Midas Depth Estimation

Create depth maps using Midas depth estimation.

image-to-image
inference
utility
Remove Background

Remove the background from an image.

image-to-image
inference
utility
Upscale Images

Upscale images by a given factor.

image-to-image
inference
utility
ControlNet SDXL

Generate Images with ControlNet.

image-to-image
inference
Inpainting sdxl and sd

Inpaint images with SD and SDXL

image-to-image
inference
Animatediff SparseCtrl LCM

Animate Your Drawings with Latent Consistency Models!

text-to-video
inference
stylized
Controlled Stable Video Diffusion

Generate short video clips from your images.

image-to-image
inference
stylized
Magic Animate

Generate short video clips from motion sequence.

image-to-image
inference
stylized
Swap Face

Swap a face between two images.

image-to-image
inference
utility
IP Adapter Face ID

High quality zero-shot personalization

image-to-image
inference
stylized
Marigold Depth Estimation

Create depth maps using Marigold depth estimation.

image-to-image
inference
utility
DreamTalk

Animate Faces with Audio Files

video-to-video
inference
utility
XTTS

text-to-audio
inference
utility
DiffusionEdge

Diffusion based high quality edge detection

text-to-image
inference
Stability Zero123 Upscale

Turn an image to 3D rotating video of the object

image-to-video
inference
Stable Diffusion XL Image to Image with LoRAs

Run Stable Diffusion XL with customizable LoRA weights.

image-to-image
inference
stylized
openlrm

Image to 3D Rotating Video and Mesh in Seconds

image-to-video
inference
stylized
InstaSoyjaknow

SOYJAK!!!!!!

image-to-image
inference
stylized
Face Retoucher

Automatically retouches faces to smooth skin and remove blemishes.

image-to-image
inference
utility
LLaVA v1.5 13B

Vision

vision
inference
NSFW Filter

Predict the probability of an image being NSFW.

image-to-json
inference
utility
SUPIR Upscaler

A Powerful Image Upscaler

image-to-image
inference
utility