Model Gallery
Flux is here!
Discover the latest in text-to-image technology with enhanced multi-subject capabilities, improved image quality, and better spelling accuracy.
Explore Models
AuraFlow
Fully open flow based text to image model
FLUX.1 [dev]
FLUX.1, a 12B parameters text-to-image model with outstanding aesthetics.
FLUX.1 [dev] with LoRAs
Super fast endpoint for the FLUX.1 [dev] model with LoRA support.
FLUX Realism LoRA
FLUX Realism LoRA is a cutting edge model for generating realistic images with the SOTA Flux Model.
FLUX.1 [schnell]
A distilled version of FLUX.1 that operates up to 10 times faster.
FLUX.1 [pro]
The pro version of FLUX.1, served in partnership with BFL
FLUX.1 [dev] with Controlnets and Loras
A general purpose endpoint for the FLUX.1 [dev] model, which can be used with a variety of extensions including any LoRA support.
CogVideoX-5B
Generate videos from prompts using CogVideoX-5B
FLUX.1 [dev] Differential Diffusion
Differential diffusion implementation for FLUX.1 [dev].
Stable Diffusion V3
Run SD3 at the speed of light
Stable Diffusion XL
Run SDXL at the speed of light
Stable Diffusion with LoRAs
Run Any Stable Diffusion model with customizable LoRA weights.
AuraSR
Upscale your images with AuraSR.
Stable Cascade
Stable Cascade: Image generation on a smaller & cheaper latent space.
High Quality Stable Video Diffusion
Generate short video clips from your images using SVD v1.1
Luma Dream Machine
Generate video clips from your prompts using Luma Dream Machine v1.5
Birefnet Background Removal
bilateral reference framework (BiRefNet) for high-resolution dichotomous image segmentation (DIS)
Creative Upscaler
Create creative upscaled images.
Clarity Upscaler
Clarity upscaler for images with high fidelity.
CCSR Upscaler
SOTA Image Upscaler
Stable Diffusion Turbo (v1.5/XL)
Run SDXL at the speed of light
Latent Consistency Models (v1.5/XL)
Run SDXL at the speed of light
Whisper
Whisper is a model for speech transcription and translation.
Wizper (Whisper v3 -- fal.ai edition)
[Experimental] Whisper v3 Large -- but optimized by our inference wizards. Same WER, double the performance!
Stable Diffusion XL Lightning
Run SDXL at the speed of light
Hyper SDXL
Hyper-charge SDXL's performance and creativity.
Playground v2.5
State-of-the-art open-source model in aesthetic quality
AMT Interpolation
Interpolate between video frames
T2V Turbo - Video Crafter
Generate short video clips from your prompts
SD 1.5 Depth ControlNet
SD 1.5 ControlNet
PhotoMaker
Customizing Realistic Human Photos via Stacked ID Embedding
Latent Consistency (SDXL & SDv1.5)
Produce high-quality images with minimal inference steps.
Optimized Latent Consistency (SDv1.5)
Produce high-quality images with minimal inference steps. Optimized for 512x512 input image size.
Fooocus
Default parameters with automated optimizations and quality improvements.
AnimateDiff Video-to-Video Evolved
Re-animate your videos with evolved consistency!
AnimateDiff
Animate your ideas!
AnimateDiff Turbo
Animate your ideas in lightning speed!
Illusion Diffusion
Create illusions conditioned on image.
Midas Depth Estimation
Create depth maps using Midas depth estimation.
Remove Background
Remove the background from an image.
Upscale Images
Upscale images by a given factor.
ControlNet SDXL
Generate Images with ControlNet.
Inpainting sdxl and sd
Inpaint images with SD and SDXL
Animatediff SparseCtrl LCM
Animate Your Drawings with Latent Consistency Models!
PuLID
Tuning-free ID customization.
IP Adapter Face ID
High quality zero-shot personalization
Marigold Depth Estimation
Create depth maps using Marigold depth estimation.
Stable Audio Open
Open source text-to-audio model.
DiffusionEdge
Diffusion based high quality edge detection
TripoSR
State of the art Image to 3D Object generation
Face Retoucher
Automatically retouches faces to smooth skin and remove blemishes.
LLaVA v1.5 13B
Vision
LLaVA v1.6 34B
Vision
NSFW Filter
Predict the probability of an image being NSFW.
Face to Sticker
Create stickers from faces.
Moondream
Answer questions from the images.
Sad Talker
Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Stable Diffusion with LoRAs
Run Any Stable Diffusion model with customizable LoRA weights.
Stable Diffusion XL
Run SDXL at the speed of light
Stable Diffusion XL
Run SDXL at the speed of light
Stable Diffusion with LoRAs
Run Any Stable Diffusion model with customizable LoRA weights.
PixArt-Σ
Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Dreamshaper
Dreamshaper model.
Realistic Vision
Generate realistic images.
Lightning Models
Collection of SDXL Lightning models.
Omni Zero
Any pose, any style, any identity
Virtual Try-On
Image based Virtual Try-On
DWPose Pose Prediction
Predict poses.
SoteDiffusion
Anime finetune of Würstchen V3.
Florence-2 Large
Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks
Live Portrait
Transfer expression from a video to a portrait.
Kolors
Photorealistic Text-to-Image
SDXL ControlNet Union
An efficent SDXL multi-controlnet text-to-image model.
SDXL ControlNet Union
An efficent SDXL multi-controlnet image-to-image model.
SDXL ControlNet Union
An efficent SDXL multi-controlnet inpainting model.
Segment Anything Model 2
SAM 2 is a model for segmenting images and videos in real-time.
MiniCPM-V 2.6
Multimodal vision-language model for single/multi image and video understanding
ControlNeXt SVD
Animate a reference image with a driving video using ControlNeXt.
Image Preprocessors
Various image preprocessing tools for ControlNet and other applications.