ic-light-background
Sign in to run

Search

Video to Video

background texture
fal-ai/sync-lipsync

Generate realistic lipsync animations from audio using advanced algorithms for high-quality synchronization.

animation
lip sync
background texture
fal-ai/video-upscaler

The video upscaler endpoint uses RealESRGAN on each frame of the input video to upscale the video to a higher resolution.

video generation
video to video
ai video
high fidelity motion
background texture
fal-ai/auto-caption

Automatically generates text captions for your videos from the audio as per text colour/font specifications

captioning
video
background texture
fal-ai/mmaudio-v2

MMAudio generates synchronized audio given video and/or text inputs. It can be combined with video models to get videos with audio.

ai video
fast
background texture
fal-ai/cogvideox-5b/video-to-video

Generate videos from videos and prompts using CogVideoX-5B

editing
background texture
fal-ai/ffmpeg-api/compose

Compose videos from multiple media sources using FFmpeg API.

ffmpeg
background texture
fal-ai/amt-interpolation

Interpolate between video frames

interpolation
editing
background texture
fal-ai/animatediff-v2v

Re-animate your videos with evolved consistency!

animation
stylized
background texture
fal-ai/animatediff-v2v/turbo

Re-animate your videos with evolved consistency!

animation
stylized
turbo
background texture
fal-ai/fast-animatediff/video-to-video

Re-animate your videos!

animation
stylized
background texture
fal-ai/fast-animatediff/turbo/video-to-video

Re-animate your videos in lightning speed!

animation
stylized
turbo
background texture
fal-ai/dubbing

This endpoint delivers seamlessly localized videos by generating lip-synced dubs in multiple languages, ensuring natural and immersive multilingual experiences

animation
lip sync
dubbing
background texture
fal-ai/sam2/video

SAM 2 is a model for segmenting images and videos in real-time.

segmentation
mask
real-time
background texture
fal-ai/controlnext

Animate a reference image with a driving video using ControlNeXt.

animation
stylized
background texture
fal-ai/latentsync

LatentSync is a video-to-video model that generates lip sync animations from audio using advanced algorithms for high-quality synchronization.

animation
lip sync
background texture
110602490/join-audio-video

Join Audio and Video

utils

Text to Audio

background texture
fal-ai/minimax-music

Generate music from text prompts using the MiniMax model, which leverages advanced AI techniques to create high-quality, diverse musical compositions.

music
background texture
fal-ai/mmaudio-v2/text-to-audio

MMAudio generates synchronized audio given text inputs. It can generate sounds described by a prompt.

audio
fast
background texture
fal-ai/stable-audio

Open source text-to-audio model.

music
background texture
fal-ai/f5-tts

F5 TTS

speech

Text to Image

background texture
fal-ai/flux-pro/v1.1-ultra

FLUX1.1 [pro] ultra is the newest version of FLUX1.1 [pro], maintaining professional-grade image quality while delivering up to 2K resolution with improved photo realism.

high-res
realism
background texture
fal-ai/flux-pro/v1.1-ultra-finetuned

FLUX1.1 [pro] ultra fine-tuned is the newest version of FLUX1.1 [pro] with a fine-tuned LoRA, maintaining professional-grade image quality while delivering up to 2K resolution with improved photo realism.

high-res
realism
background texture
fal-ai/ideogram/v2

Generate high-quality images, posters, and logos with Ideogram V2. Features exceptional typography handling and realistic outputs optimized for commercial and creative use.

realism
typography
background texture
fal-ai/recraft-v3

Recraft V3 is a text-to-image model with the ability to generate long texts, vector art, images in brand style, and much more. As of today, it is SOTA in image generation, proven by Hugging Face's industry-leading Text-to-Image Benchmark by Artificial Analysis.

vector
typography
style
background texture
fal-ai/aura-flow

AuraFlow v0.3 is an open-source flow-based text-to-image generation model that achieves state-of-the-art results on GenEval. The model is currently in beta.

typography
style
background texture
fal-ai/flux/dev

FLUX.1 [dev] is a 12 billion parameter flow transformer that generates high-quality images from text. It is suitable for personal and commercial use.

background texture
fal-ai/flux-lora

Super fast endpoint for the FLUX.1 [dev] model with LoRA support, enabling rapid and high-quality image generation using pre-trained LoRA adaptations for personalization, specific styles, brand identities, and product-specific outputs.

lora
personalization
background texture
fal-ai/flux-lora/inpainting

Super fast endpoint for the FLUX.1 [dev] inpainting model with LoRA support, enabling rapid and high-quality image inpaingting using pre-trained LoRA adaptations for personalization, specific styles, brand identities, and product-specific outputs.

lora
personalization
background texture
fal-ai/flux/schnell

FLUX.1 [schnell] is a 12 billion parameter flow transformer that generates high-quality images from text in 1 to 4 steps, suitable for personal and commercial use.

background texture
fal-ai/flux-subject

Super fast endpoint for the FLUX.1 [schnell] model with subject input capabilities, enabling rapid and high-quality image generation for personalization, specific styles, brand identities, and product-specific outputs.

personalization
customization
background texture
fal-ai/flux-pro/v1.1

FLUX1.1 [pro] is an enhanced version of FLUX.1 [pro], improved image generation capabilities, delivering superior composition, detail, and artistic fidelity compared to its predecessor.

background texture
fal-ai/flux-pro/new

FLUX.1 [pro] new is an accelerated version of FLUX.1 [pro], maintaining professional-grade image quality while delivering significantly faster generation speeds.

background texture
fal-ai/sana

Sana can synthesize high-resolution, high-quality images with strong text-image alignment at a remarkably fast speed, with the ability to generate 4K images in less than a second.

background texture
fal-ai/omnigen-v1

OmniGen is a unified image generation model that can generate a wide range of images from multi-modal prompts. It can be used for various tasks such as Image Editing, Personalized Image Generation, Virtual Try-On, Multi Person Generation and more!

multimodal
editing
try-on
background texture
fal-ai/stable-diffusion-v35-large

Stable Diffusion 3.5 Large is a Multimodal Diffusion Transformer (MMDiT) text-to-image model that features improved performance in image quality, typography, complex prompt understanding, and resource-efficiency.

diffusion
typography
style
background texture
fal-ai/stable-diffusion-v35-medium

Stable Diffusion 3.5 Medium is a Multimodal Diffusion Transformer (MMDiT) text-to-image model that features improved performance in image quality, typography, complex prompt understanding, and resource-efficiency.

diffusion
typography
style
background texture
fal-ai/switti

Switti is a scale-wise transformer for fast text-to-image generation that outperforms existing T2I AR models and competes with state-of-the-art T2I diffusion models while being faster than distilled diffusion models.

background texture
fal-ai/switti/512

Switti is a scale-wise transformer for fast text-to-image generation that outperforms existing T2I AR models and competes with state-of-the-art T2I diffusion models while being faster than distilled diffusion models.

background texture
fal-ai/recraft-20b

Recraft 20b is a new and affordable text-to-image model.

image generation
vector art
typograph
style
background texture
fal-ai/ideogram/v2/turbo

Accelerated image generation with Ideogram V2 Turbo. Create high-quality visuals, posters, and logos with enhanced speed while maintaining Ideogram's signature quality.

realism
typography
background texture
fal-ai/bria/text-to-image/base

Bria's Text-to-Image model, trained exclusively on licensed data for safe and risk-free commercial use. Available also as source code and weights. For access to weights: https://bria.ai/contact-us

image generation
background texture
fal-ai/bria/text-to-image/fast

Bria's Text-to-Image model with perfect harmony of latency and quality. Trained exclusively on licensed data for safe and risk-free commercial use. Available also as source code and weights. For access to weights: https://bria.ai/contact-us

image generation
background texture
fal-ai/bria/text-to-image/hd

Bria's Text-to-Image model for HD images. Trained exclusively on licensed data for safe and risk-free commercial use. Available also as source code and weights. For access to weights: https://bria.ai/contact-us

image generation
background texture
fal-ai/flux-general

A versatile endpoint for the FLUX.1 [dev] model that supports multiple AI extensions including LoRA, ControlNet conditioning, and IP-Adapter integration, enabling comprehensive control over image generation through various guidance methods.

lora
controlnet
ip-adapter
background texture
fal-ai/stable-diffusion-v3-medium

Stable Diffusion 3 Medium (Text to Image) is a Multimodal Diffusion Transformer (MMDiT) model that improves image quality, typography, prompt understanding, and efficiency.

diffusion
style
background texture
fal-ai/fast-sdxl

Run SDXL at the speed of light

diffusion
lora
embeddings
high-res
style
background texture
fal-ai/lora

Run Any Stable Diffusion model with customizable LoRA weights.

diffusion
lora
customization
background texture
fal-ai/stable-cascade

Stable Cascade: Image generation on a smaller & cheaper latent space.

diffusion
lcm
background texture
fal-ai/luma-photon

Generate images from your prompts using Luma Photon. Photon is the most creative, personalizable, and intelligent visual models for creatives, bringing a step-function change in the cost of high-quality image generation.

background texture
fal-ai/luma-photon/flash

Generate images from your prompts using Luma Photon Flash. Photon Flash is the most creative, personalizable, and intelligent visual models for creatives, bringing a step-function change in the cost of high-quality image generation.

background texture
fal-ai/fast-turbo-diffusion

Run SDXL at the speed of light

diffusion
turbo
real-time
background texture
fal-ai/fast-lcm-diffusion

Run SDXL at the speed of light

lcm
diffusion
turbo
real-time
background texture
fal-ai/fast-lightning-sdxl

Run SDXL at the speed of light

diffusion
lightning
real-time
background texture
fal-ai/hyper-sdxl

Hyper-charge SDXL's performance and creativity.

diffusion
real-time
background texture
fal-ai/playground-v25

State-of-the-art open-source model in aesthetic quality

artistic
style
background texture
fal-ai/lcm

Produce high-quality images with minimal inference steps.

diffusion
lcm
real-time
background texture
fal-ai/fooocus

Default parameters with automated optimizations and quality improvements.

stylized
background texture
fal-ai/illusion-diffusion

Create illusions conditioned on image.

composition
stylized
background texture
fal-ai/fast-sdxl-controlnet-canny

Generate Images with ControlNet.

diffusion
controlnet
manipulation
background texture
fal-ai/diffusion-edge

Diffusion based high quality edge detection

detection
background texture
fal-ai/fooocus/upscale-or-vary

Default parameters with automated optimizations and quality improvements.

upscaling
vary
stylized
background texture
fal-ai/fooocus/image-prompt

Default parameters with automated optimizations and quality improvements.

stylized
background texture
fal-ai/fooocus/inpaint

Default parameters with automated optimizations and quality improvements.

stylized
editing
background texture
fal-ai/fast-fooocus-sdxl

Fooocus extreme speed mode as a standalone app.

background texture
fal-ai/fast-fooocus-sdxl/image-to-image

Fooocus extreme speed mode as a standalone app.

stylized
background texture
fal-ai/layer-diffusion

SDXL with an alpha channel.

background texture
fal-ai/stable-diffusion-v15

Stable Diffusion v1.5

diffusion
background texture
fal-ai/pixart-sigma

Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

diffusion
background texture
fal-ai/dreamshaper

Dreamshaper model.

stylized
diffusion
background texture
fal-ai/realistic-vision

Generate realistic images.

realism
diffusion
background texture
fal-ai/lightning-models

Collection of SDXL Lightning models.

diffusion
lightning
background texture
fal-ai/stable-cascade/sote-diffusion

Anime finetune of Würstchen V3.

lcm
stylized
background texture
fal-ai/kolors

Photorealistic Text-to-Image

realism
diffusion
background texture
fal-ai/sdxl-controlnet-union

An efficent SDXL multi-controlnet text-to-image model.

diffusion
controlnet
composition
background texture
fal-ai/any-sd

Run dedicated SDXL APIs

inference
loras
embeddings

Training

background texture
fal-ai/hunyuan-video-lora-training

Train Hunyuan Video lora on people, objects, characters and more!

lora
personalization
background texture
fal-ai/flux-lora-fast-training

Train styles, people and other subjects at blazing speeds.

lora
personalization
background texture
fal-ai/flux-lora-portrait-trainer

FLUX LoRA training optimized for portrait generation, with bright highlights, excellent prompt following and highly detailed results.

lora
personalization
background texture
fal-ai/flux-pro-trainer

FLUX LoRA for Pro endpoints.

lora
personalization
background texture
fal-ai/recraft-v3/create-style

Recraft V3 Create Style is capable of creating unique styles for Recraft V3 based on your images.

style
vector
personalization

Text to Video

background texture
fal-ai/minimax/video-01-live

Generate video clips from your prompts using MiniMax model

motion
transformation
background texture
fal-ai/haiper-video/v2

Transform text into hyper-realistic videos with Haiper 2.0. Experience industry-leading resolution, fluid motion, and rapid generation for stunning AI videos.

motion
background texture
fal-ai/haiper-video/v2.5/fast

Transform text into hyper-realistic videos with Haiper 2.5. Experience industry-leading resolution, fluid motion, and rapid generation for stunning AI videos.

motion
background texture
fal-ai/minimax/video-01

Generate video clips from your prompts using MiniMax model

motion
transformation
background texture
fal-ai/mochi-v1

Mochi 1 preview is an open state-of-the-art video generation model with high-fidelity motion and strong prompt adherence in preliminary evaluation.

background texture
fal-ai/hunyuan-video

Hunyuan Video is an Open video generation model with high visual quality, motion diversity, text-video alignment, and generation stability

background texture
fal-ai/hunyuan-video-lora

Hunyuan Video is an Open video generation model with high visual quality, motion diversity, text-video alignment, and generation stability

background texture
fal-ai/luma-dream-machine

Generate video clips from your prompts using Luma Dream Machine v1.5

motion
transformation
background texture
fal-ai/kling-video/v1/standard/text-to-video

Generate video clips from your prompts using Kling 1.0

motion
background texture
fal-ai/kling-video/v1/pro/text-to-video

Generate video clips from your prompts using Kling 1.0 (pro)

motion
background texture
fal-ai/kling-video/v1.5/pro/text-to-video

Generate video clips from your prompts using Kling 1.5 (pro)

background texture
fal-ai/kling-video/v1.6/standard/text-to-video

Generate video clips from your prompts using Kling 1.6 (std)

background texture
fal-ai/transpixar

Transform text into stunning videos with TransPixar - an AI model that generates both RGB footage and alpha channels, enabling seamless compositing and creative video effects.

background texture
fal-ai/cogvideox-5b

Generate videos from prompts using CogVideoX-5B

background texture
fal-ai/ltx-video

Generate videos from prompts using LTX Video

background texture
fal-ai/fast-svd/text-to-video

Generate short video clips from your prompts using SVD v1.1

background texture
fal-ai/fast-svd-lcm/text-to-video

Generate short video clips from your images using SVD v1.1 at Lightning Speed

lcm
diffusion
turbo
background texture
fal-ai/t2v-turbo

Generate short video clips from your prompts

turbo
background texture
fal-ai/fast-animatediff/text-to-video

Animate your ideas!

animation
stylized
background texture
fal-ai/fast-animatediff/turbo/text-to-video

Animate your ideas in lightning speed!

animation
stylized
turbo
background texture
fal-ai/animatediff-sparsectrl-lcm

Animate Your Drawings with Latent Consistency Models!

lcm
animation
stylized

Image to Video

background texture
fal-ai/minimax/video-01-live/image-to-video

Generate video clips from your images using MiniMax Video model

motion
transformation
background texture
fal-ai/minimax/video-01-subject-reference

Generate video clips maintaining consistent, realistic facial features and identity across dynamic video content

subject
transformation
background texture
fal-ai/minimax/video-01/image-to-video

Generate video clips from your images using MiniMax Video model

motion
transformation
background texture
fal-ai/haiper-video/v2/image-to-video

Transform text into hyper-realistic videos with Haiper 2.0. Experience industry-leading resolution, fluid motion, and rapid generation for stunning AI videos.

motion
background texture
fal-ai/haiper-video/v2.5/image-to-video/fast

Transform text into hyper-realistic videos with Haiper 2.5. Experience industry-leading resolution, fluid motion, and rapid generation for stunning AI videos.

motion
background texture
fal-ai/luma-dream-machine/image-to-video

Generate video clips from your images using Luma Dream Machine v1.5

motion
transformation
background texture
fal-ai/kling-video/v1/standard/image-to-video

Generate video clips from your images using Kling 1.0

motion
background texture
fal-ai/kling-video/v1/pro/image-to-video

Generate video clips from your images using Kling 1.0 (pro)

motion
background texture
fal-ai/kling-video/v1.5/pro/image-to-video

Generate video clips from your images using Kling 1.5 (pro)

background texture
fal-ai/kling-video/v1.6/standard/image-to-video

Generate video clips from your images using Kling 1.6 (std)

background texture
fal-ai/kling-video/v1.6/pro/image-to-video

Generate video clips from your images using Kling 1.6 (pro)

background texture
fal-ai/cogvideox-5b/image-to-video

Generate videos from images and prompts using CogVideoX-5B

background texture
fal-ai/ltx-video/image-to-video

Generate videos from images using LTX Video

background texture
fal-ai/stable-video

Generate short video clips from your images using SVD v1.1

background texture
fal-ai/fast-svd-lcm

Generate short video clips from your images using SVD v1.1 at Lightning Speed

turbo
background texture
fal-ai/amt-interpolation/frame-interpolation

Interpolate between image frames

interpolation
editing
background texture
fal-ai/sadtalker

Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

animation
background texture
fal-ai/musetalk

MuseTalk is a real-time high quality audio-driven lip-syncing model. Use MuseTalk to animate a face with your own audio.

animation
lip sync
real-time
background texture
fal-ai/sadtalker/reference

Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

animation
background texture
fal-ai/live-portrait

Transfer expression from a video to a portrait.

expression
animation

Image to 3D

background texture
fal-ai/hyper3d/rodin

Rodin by Hyper3D generates realistic and production ready 3D models from text or images.

stylized
background texture
fal-ai/triposr

State of the art Image to 3D Object generation

background texture
fal-ai/trellis

Generate 3D models from your images using Trellis. A native 3D generative model enabling versatile and high-quality 3D asset creation.

stylized

Image to Image

background texture
fal-ai/flux/dev/image-to-image

FLUX.1 Image-to-Image is a high-performance endpoint for the FLUX.1 [dev] model that enables rapid transformation of existing images, delivering high-quality style transfers and image modifications with the core FLUX capabilities.

style transfer
background texture
fal-ai/flux/schnell/redux

FLUX.1 [schnell] Redux is a high-performance endpoint for the FLUX.1 [schnell] model that enables rapid transformation of existing images, delivering high-quality style transfers and image modifications with the core FLUX capabilities.

style transfer
background texture
fal-ai/flux/dev/redux

FLUX.1 [dev] Redux is a high-performance endpoint for the FLUX.1 [dev] model that enables rapid transformation of existing images, delivering high-quality style transfers and image modifications with the core FLUX capabilities.

style transfer
lora
background texture
fal-ai/flux-pro/v1/redux

FLUX.1 [pro] Redux is a high-performance endpoint for the FLUX.1 [pro] model that enables rapid transformation of existing images, delivering high-quality style transfers and image modifications with the core FLUX capabilities.

style transfer
background texture
fal-ai/flux-pro/v1.1/redux

FLUX1.1 [pro] Redux is a high-performance endpoint for the FLUX1.1 [pro] model that enables rapid transformation of existing images, delivering high-quality style transfers and image modifications with the core FLUX capabilities.

style transfer
background texture
fal-ai/flux-pro/v1.1-ultra/redux

FLUX1.1 [pro] ultra Redux is a high-performance endpoint for the FLUX1.1 [pro] model that enables rapid transformation of existing images, delivering high-quality style transfers and image modifications with the core FLUX capabilities.

style transfer
high-res
background texture
fal-ai/flux-pro/v1/fill

FLUX.1 [pro] Fill is a high-performance endpoint for the FLUX.1 [pro] model that enables rapid transformation of existing images, delivering high-quality style transfers and image modifications with the core FLUX capabilities.

editing
background texture
fal-ai/flux-pro/v1/fill-finetuned

FLUX.1 [pro] Fill Fine-tuned is a high-performance endpoint for the FLUX.1 [pro] model with a fine-tuned LoRA that enables rapid transformation of existing images, delivering high-quality style transfers and image modifications with the core FLUX capabilities.

editing
background texture
fal-ai/flux-pro/v1/canny

Utilize Flux.1 [pro] Controlnet to generate high-quality images with precise control over composition, style, and structure through advanced edge detection and guidance mechanisms.

controlnet
detection
editing
composition
background texture
fal-ai/flux-pro/v1/canny-finetuned

Utilize Flux.1 [pro] Controlnet with a fine-tuned LoRA to generate high-quality images with precise control over composition, style, and structure through advanced edge detection and guidance mechanisms.

controlnet
detection
editing
composition
background texture
fal-ai/flux-pro/v1/depth

Generate high-quality images from depth maps using Flux.1 [pro] depth estimation model. The model produces accurate depth representations for scene understanding and 3D visualization.

depth
utility
composition
background texture
fal-ai/flux-pro/v1/depth-finetuned

Generate high-quality images from depth maps using Flux.1 [pro] depth estimation model with a fine-tuned LoRA. The model produces accurate depth representations for scene understanding and 3D visualization.

depth
utility
composition
background texture
fal-ai/flux-lora-canny

Utilize Flux.1 [dev] Controlnet to generate high-quality images with precise control over composition, style, and structure through advanced edge detection and guidance mechanisms.

controlnet
detection
lora
editing
composition
background texture
fal-ai/flux-lora-depth

Generate high-quality images from depth maps using Flux.1 [dev] depth estimation model. The model produces accurate depth representations for scene understanding and 3D visualization.

depth
lora
utility
composition
background texture
fal-ai/ideogram/v2/edit

Transform existing images with Ideogram V2's editing capabilities. Modify, adjust, and refine images while maintaining high fidelity and realistic outputs with precise prompt control.

realism
typography
background texture
fal-ai/ideogram/v2/remix

Reimagine existing images with Ideogram V2's remix feature. Create variations and adaptations while preserving core elements and adding new creative directions through prompt guidance.

realism
typography
background texture
fal-ai/ideogram/v2/turbo/edit

Edit images faster with Ideogram V2 Turbo. Quick modifications and adjustments while preserving the high-quality standards and realistic outputs of Ideogram.

realism
typography
background texture
fal-ai/ideogram/v2/turbo/remix

Rapidly create image variations with Ideogram V2 Turbo Remix. Fast and efficient reimagining of existing images while maintaining creative control through prompt guidance.

realism
typography
background texture
fal-ai/bria/eraser

Bria Eraser enables precise removal of unwanted objects from images while maintaining high-quality outputs. Trained exclusively on licensed data for safe and risk-free commercial use. Access the model's source code and weights: https://bria.ai/contact-us

image editing
object removal
background texture
fal-ai/bria/product-shot

Place any product in any scenery with just a prompt or reference image while maintaining high integrity of the product. Trained exclusively on licensed data for safe and risk-free commercial use and optimized for eCommerce.

product photography
background texture
fal-ai/bria/background/replace

Bria Background Replace allows for efficient swapping of backgrounds in images via text prompts or reference image, delivering realistic and polished results. Trained exclusively on licensed data for safe and risk-free commercial use

image editing
background texture
fal-ai/bria/genfill

Bria GenFill enables high-quality object addition or visual transformation. Trained exclusively on licensed data for safe and risk-free commercial use. Access the model's source code and weights: https://bria.ai/contact-us

image editing
background texture
fal-ai/bria/expand

Bria Expand expands images beyond their borders in high quality. Trained exclusively on licensed data for safe and risk-free commercial use. Access the model's source code and weights: https://bria.ai/contact-us

outpainting
background texture
fal-ai/bria/background/remove

Bria RMBG 2.0 enables seamless removal of backgrounds from images, ideal for professional editing tasks. Trained exclusively on licensed data for safe and risk-free commercial use. Model weights for commercial use are available here: https://share-eu1.hsforms.com/2GLpEVQqJTI2Lj7AMYwgfIwf4e04?utm_campaign=RMBG%202.0&utm_source=RMBG%20image%20and%20video%20page&utm_medium=button&utm_content=rmbg%20image%20pricing%20form

background removal
image segmentation
high resolution
utility
rembg
background texture
fal-ai/flux-lora-fill

FLUX.1 [dev] Fill is a high-performance endpoint for the FLUX.1 [pro] model that enables rapid transformation of existing images, delivering high-quality style transfers and image modifications with the core FLUX capabilities.

editing
lora
background texture
fal-ai/flux-lora/image-to-image

FLUX LoRA Image-to-Image is a high-performance endpoint that transforms existing images using FLUX models, leveraging LoRA adaptations to enable rapid and precise image style transfer, modifications, and artistic variations.

lora
style transfer
background texture
fal-ai/flux-general/inpainting

FLUX General Inpainting is a versatile endpoint that enables precise image editing and completion, supporting multiple AI extensions including LoRA, ControlNet, and IP-Adapter for enhanced control over inpainting results and sophisticated image modifications.

lora
controlnet
ip-adapter
background texture
fal-ai/flux-general/image-to-image

FLUX General Image-to-Image is a versatile endpoint that transforms existing images with support for LoRA, ControlNet, and IP-Adapter extensions, enabling precise control over style transfer, modifications, and artistic variations through multiple guidance methods.

lora
controlnet
ip-adapter
background texture
fal-ai/flux-general/differential-diffusion

A specialized FLUX endpoint combining differential diffusion control with LoRA, ControlNet, and IP-Adapter support, enabling precise, region-specific image transformations through customizable change maps.

lora
controlnet
ip-adapter
background texture
fal-ai/flux-general/rf-inversion

A general purpose endpoint for the FLUX.1 [dev] model, implementing the RF-Inversion pipeline. This can be used to edit a reference image based on a prompt.

rf-inversion
editing
lora
background texture
fal-ai/flux-pulid

An endpoint for personalized image generation using Flux as per given description.

personalization
style transfer
background texture
fal-ai/iclight-v2

An endpoint for re-lighting photos and changing their backgrounds per a given description

relighting
editing
background texture
fal-ai/flux-differential-diffusion

FLUX.1 Differential Diffusion is a rapid endpoint that enables swift, granular control over image transformations through change maps, delivering fast and precise region-specific modifications while maintaining FLUX.1 [dev]'s high-quality output.

transformation
background texture
fal-ai/stable-diffusion-v3-medium/image-to-image

Stable Diffusion 3 Medium (Image to Image) is a Multimodal Diffusion Transformer (MMDiT) model that improves image quality, typography, prompt understanding, and efficiency.

diffusion
editing
style
background texture
fal-ai/aura-sr

Upscale your images with AuraSR.

upscaling
high-res
background texture
fal-ai/kling/v1-5/kolors-virtual-try-on

Kling Kolors Virtual TryOn v1.5 is a high quality image based Try-On endpoint which can be used for commercial try on.

try-on
fashion
clothing
background texture
fal-ai/birefnet/v2

bilateral reference framework (BiRefNet) for high-resolution dichotomous image segmentation (DIS)

background removal
segmentation
high-res
utility
background texture
fal-ai/creative-upscaler

Create creative upscaled images.

upscaling
background texture
fal-ai/clarity-upscaler

Clarity upscaler for images with high fidelity.

upscaling
background texture
fal-ai/ccsr

SOTA Image Upscaler

upscaling
background texture
fal-ai/fast-turbo-diffusion/image-to-image

Run SDXL at the speed of light

diffusion
turbo
real-time
editing
background texture
fal-ai/fast-turbo-diffusion/inpainting

Run SDXL at the speed of light

diffusion
turbo
real-time
background texture
fal-ai/fast-lcm-diffusion/image-to-image

Run SDXL at the speed of light

lcm
diffusion
turbo
real-time
editing
background texture
fal-ai/fast-lcm-diffusion/inpainting

Run SDXL at the speed of light

lcm
diffusion
turbo
real-time
editing
background texture
fal-ai/fast-lightning-sdxl/image-to-image

Run SDXL at the speed of light

diffusion
lightning
editing
background texture
fal-ai/fast-lightning-sdxl/inpainting

Run SDXL at the speed of light

diffusion
lightning
background texture
fal-ai/hyper-sdxl/image-to-image

Hyper-charge SDXL's performance and creativity.

diffusion
editing
background texture
fal-ai/hyper-sdxl/inpainting

Hyper-charge SDXL's performance and creativity.

diffusion
background texture
fal-ai/playground-v25/image-to-image

State-of-the-art open-source model in aesthetic quality

artistic
style
background texture
fal-ai/playground-v25/inpainting

State-of-the-art open-source model in aesthetic quality

inpaint
artistic
style
background texture
fal-ai/sd15-depth-controlnet

SD 1.5 ControlNet

diffusion
editing
manipulation
controlnet
background texture
fal-ai/photomaker

Customizing Realistic Human Photos via Stacked ID Embedding

editing
customization
realism
personalization
background texture
fal-ai/lcm-sd15-i2i

Produce high-quality images with minimal inference steps. Optimized for 512x512 input image size.

diffusion
lcm
real-time
background texture
fal-ai/imageutils/depth

Create depth maps using Midas depth estimation.

depth
utility
background texture
fal-ai/imageutils/rembg

Remove the background from an image.

background removal
utility
editing
background texture
fal-ai/esrgan

Upscale images by a given factor.

upscaling
high-res
background texture
fal-ai/fast-sdxl-controlnet-canny/image-to-image

Generate Images with ControlNet.

diffusion
controlnet
editing
manipulation
background texture
fal-ai/fast-sdxl-controlnet-canny/inpainting

Generate Images with ControlNet.

diffusion
controlnet
editing
manipulation
background texture
fal-ai/inpaint

Inpaint images with SD and SDXL

editing
diffusion
background texture
fal-ai/pulid

Tuning-free ID customization.

editing
customization
personalization
background texture
fal-ai/ip-adapter-face-id

High quality zero-shot personalization

ip-adapter
personalization
customization
editing
background texture
fal-ai/imageutils/marigold-depth

Create depth maps using Marigold depth estimation.

depth
utility
background texture
fal-ai/retoucher

Automatically retouches faces to smooth skin and remove blemishes.

editing
background texture
fal-ai/face-to-sticker

Create stickers from faces.

sticker
editing
background texture
fal-ai/lora/image-to-image

Run Any Stable Diffusion model with customizable LoRA weights.

diffusion
lora
customization
fine-tuning
background texture
fal-ai/fast-sdxl/image-to-image

Run SDXL at the speed of light

diffusion
high-res
lora
ip-adapter
controlnet
background texture
fal-ai/fast-sdxl/inpainting

Run SDXL at the speed of light

diffusion
high-res
lora
ip-adapter
controlnet
background texture
fal-ai/lora/inpaint

Run Any Stable Diffusion model with customizable LoRA weights.

diffusion
lora
customization
fine-tuning
background texture
fal-ai/omni-zero

Any pose, any style, any identity

style transfer
background texture
fal-ai/leffa/virtual-tryon

Leffa Virtual TryOn is a high quality image based Try-On endpoint which can be used for commercial try on.

try-on
fashion
clothing
background texture
fal-ai/leffa/pose-transfer

Leffa Pose Transfer is an endpoint for changing pose of an image with a reference image.

pose
utility
background texture
fal-ai/cat-vton

Image based Virtual Try-On

try-on
fashion
clothing
background texture
fal-ai/dwpose

Predict poses.

pose
utility
background texture
fal-ai/florence-2-large/object-detection

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

detection
multimodal
vision
background texture
fal-ai/florence-2-large/dense-region-caption

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

multimodal
vision
background texture
fal-ai/florence-2-large/region-proposal

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

multimodal
vision
background texture
fal-ai/florence-2-large/caption-to-phrase-grounding

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

multimodal
vision
background texture
fal-ai/florence-2-large/referring-expression-segmentation

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

multimodal
vision
segmentation
background texture
fal-ai/florence-2-large/region-to-segmentation

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

multimodal
vision
segmentation
background texture
fal-ai/florence-2-large/open-vocabulary-detection

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

multimodal
vision
detection
background texture
fal-ai/florence-2-large/ocr-with-region

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

ocr
multimodal
vision
background texture
fal-ai/era-3d

A powerful image to novel multiview model with normals.

background texture
fal-ai/live-portrait/image

Transfer expression from a video to a portrait.

expression
animation
background texture
fal-ai/kolors/image-to-image

Photorealistic Image-to-Image

realism
editing
diffusion
background texture
fal-ai/sdxl-controlnet-union/image-to-image

An efficent SDXL multi-controlnet image-to-image model.

diffusion
controlnet
composition
background texture
fal-ai/sdxl-controlnet-union/inpainting

An efficent SDXL multi-controlnet inpainting model.

diffusion
controlnet
composition
background texture
fal-ai/sam2/image

SAM 2 is a model for segmenting images and videos in real-time.

segmentation
mask
real-time
background texture
fal-ai/workflowutils/canny

Various image preprocessing tools for ControlNet and other applications.

preprocess
controlnet
utility
editing
background texture
fal-ai/workflowutils/canny

Canny edge detection preprocessor.

preprocess
controlnet
utility
detection
background texture
fal-ai/image-preprocessors/depth-anything/v2

Depth Anything v2 preprocessor.

depth
preprocess
utility
controlnet
background texture
fal-ai/image-preprocessors/hed

Holistically-Nested Edge Detection (HED) preprocessor.

preprocess
detection
utility
controlnet
background texture
fal-ai/image-preprocessors/lineart

Line art preprocessor.

preprocess
utility
sketch
controlnet
background texture
fal-ai/image-preprocessors/midas

MiDaS depth estimation preprocessor.

depth
preprocess
utility
controlnet
background texture
fal-ai/image-preprocessors/mlsd

M-LSD line segment detection preprocessor.

preprocess
utility
controlnet
background texture
fal-ai/image-preprocessors/pidi

PIDI (Pidinet) preprocessor.

detection
preprocess
utility
controlnet
background texture
fal-ai/image-preprocessors/sam

Segment Anything Model (SAM) preprocessor.

segmentation
preprocess
utility
mask
controlnet
background texture
fal-ai/image-preprocessors/scribble

Scribble preprocessor.

preprocess
utility
editing
controlnet
sketch
background texture
fal-ai/image-preprocessors/teed

TEED (Temporal Edge Enhancement Detection) preprocessor.

preprocess
detection
utility
controlnet
background texture
fal-ai/image-preprocessors/zoe

ZoeDepth preprocessor.

depth
preprocess
utility
controlnet
background texture
fashn/tryon

FASHN delivers precise virtual try-on capabilities, accurately rendering garment details like text and patterns at 576x864 resolution from both on-model and flat-lay photo references.

try-on
fashion
clothing
background texture
fal-ai/moondream-next/detection

MoonDreamNext Detection is a multimodal vision-language model for gaze detection, bbox detection, point detection, and more.

multimodal
background texture
fal-ai/recraft-clarity-upscale

Enhances a given raster image using 'clarity upscale' tool, increasing image resolution, making the image sharper and cleaner.

upscaling
background texture
fal-ai/recraft-creative-upscale

Enhances a given raster image using 'creative upscale' tool, boosting resolution with a focus on refining small details and faces.

upscaling
background texture
110602490/workflowutils/teed

A fast SOTA edge detector

utils
background texture
110602490/workflowutils/invert-mask

Invert a mask

utils
background texture
110602490/workflowutils/blur-mask

Blur a mask

utils
background texture
110602490/workflowutils/grow-mask

Grow a mask

utils
background texture
110602490/workflowutils/shrink-mask

Shrink a mask

utils
background texture
110602490/workflowutils/transparent-image-to-mask

Convert a transparent image to a mask

utils
background texture
110602490/workflowutils/resize-image

Resize Image

utils
background texture
110602490/workflowutils/resize-to-max-pixels

Resize Image to Max Pixels

utils
background texture
110602490/workflowutils/image-size

Get an Image's Size

utils
background texture
110602490/workflowutils/composite-image

Composite Image

utils
background texture
110602490/workflowutils/insightface

face

utils
background texture
110602490/workflowutils/rgba-to-rgb

Convert a Image with Alpha Channel to RGB with a background color

utils
background texture
fal-ai/workflowutils/canny

Detect edges in an image using the Canny edge detector

utils
background texture
fal-ai/sam-hq

SAM HQ is a model for high-quality zero-shot segmentation.

inference
mask

JSON

background texture
fal-ai/ffmpeg-api/metadata

Get encoding metadata from video and audio files using FFmpeg API.

ffmpeg
background texture
fal-ai/ffmpeg-api/waveform

Get waveform data from audio files using FFmpeg API.

ffmpeg

Speech to Text

background texture
fal-ai/whisper

Whisper is a model for speech transcription and translation.

transcription
translation
speech
background texture
fal-ai/wizper

[Experimental] Whisper v3 Large -- but optimized by our inference wizards. Same WER, double the performance!

transcription
speech

Large Language Models

background texture
fal-ai/any-llm

Use any large language model from our selected catalogue (powered by OpenRouter)

chat
claude
gpt
streaming

Vision

background texture
fal-ai/any-llm/vision

Use any vision language model from our selected catalogue (powered by OpenRouter)

multimodal
vision
streaming
background texture
fal-ai/llavav15-13b

Vision

multimodal
vision
background texture
fal-ai/llava-next

Vision

multimodal
vision
background texture
fal-ai/imageutils/nsfw

Predict the probability of an image being NSFW.

filter
safety
utility
background texture
fal-ai/moondream/batched

Answer questions from the images.

multimodal
vision
background texture
fal-ai/florence-2-large/caption

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

captioning
multimodal
vision
background texture
fal-ai/florence-2-large/detailed-caption

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

captioning
multimodal
vision
background texture
fal-ai/florence-2-large/more-detailed-caption

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

captioning
multimodal
vision
background texture
fal-ai/florence-2-large/region-to-category

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

multimodal
vision
background texture
fal-ai/florence-2-large/region-to-description

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

multimodal
vision
background texture
fal-ai/florence-2-large/ocr

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

ocr
multimodal
vision
background texture
fal-ai/sa2va/8b/image

Sa2VA is an MLLM capable of question answering, visual prompt understanding, and dense object segmentation at both image and video levels

multimodal
vision
background texture
fal-ai/sa2va/8b/video

Sa2VA is an MLLM capable of question answering, visual prompt understanding, and dense object segmentation at both image and video levels

multimodal
vision
background texture
fal-ai/sa2va/4b/image

Sa2VA is an MLLM capable of question answering, visual prompt understanding, and dense object segmentation at both image and video levels

multimodal
vision
background texture
fal-ai/sa2va/4b/video

Sa2VA is an MLLM capable of question answering, visual prompt understanding, and dense object segmentation at both image and video levels

multimodal
vision
background texture
fal-ai/mini-cpm

Multimodal vision-language model for single/multi image understanding

multimodal
vision
background texture
fal-ai/mini-cpm/video

Multimodal vision-language model for video understanding

multimodal
vision
background texture
fal-ai/moondream-next

MoonDreamNext is a multimodal vision-language model for captioning, gaze detection, bbox detection, point detection, and more.

multimodal
vision
background texture
fal-ai/moondream-next/batch

MoonDreamNext Batch is a multimodal vision-language model for batch captioning.

multimodal

Text to Speech

background texture
fal-ai/playai/tts/v3

Blazing-fast text-to-speech. Generate audio with improved emotional tones and extensive multilingual support. Ideal for high-volume processing and efficient workflows.

background texture
fal-ai/playai/tts/dialog

Generate natural-sounding multi-speaker dialogues. Perfect for expressive outputs, storytelling, games, animations, and interactive media.

Request - Inputs
image_url
ic_light_model_background_image_url
image_url
model
operating_resolution
output_format
output_mask
refine_foreground
image
mask_image
model_name
unet_name
variant
prompt
negative_prompt
prompt_weighting
loras
embeddings
controlnets
controlnet_guess_mode
ip_adapter
image_encoder_path
image_encoder_subfolder
image_encoder_weight_name
ic_light_model_url
ic_light_model_background_image_url
ic_light_image_url
seed
image_size
num_inference_steps
guidance_scale
clip_skip
scheduler
timesteps
timesteps.method
timesteps.array
sigmas
sigmas.method
sigmas.array
prediction_type
rescale_betas_snr_zero
image_format
num_images
enable_safety_checker
tile_width
tile_height
tile_stride_width
tile_stride_height
eta
debug_latents
debug_per_pass_latents
images
seed
has_nsfw_concepts
debug_latents
debug_per_pass_latents
image_url
width
height
mode
resampling
scaling_proportions
cropping_position
padding_color
image
image_url
transparent_color
transparent_color.r
transparent_color.g
transparent_color.b
image
sd_loras
images
result