
Generate high quality music and sound effects using Stable Audio 2.5 from StabilityAI

Apply Gaussian or Kuwahara blur effects with adjustable radius and sigma parameters

DreamO is an image customization framework designed to support a wide range of tasks while facilitating seamless integration of multiple conditions.
Turn any image into a cute plushie!

MAGI-1 distilled extends videos faster with an exceptional understanding of physical interactions and prompts
![A general purpose endpoint for the FLUX.1 [dev] model, implementing the RF-Inversion pipeline. This can be used to edit a reference image based on a prompt.](https://refinery.fal.media/url/https%3A%2F%2Fstorage.googleapis.com%2Ffalserverless%2Fflux-lora%2Fflux_general.png/tr:w-1920,q-80/flux_general.webp)
A general purpose endpoint for the FLUX.1 [dev] model, implementing the RF-Inversion pipeline. This can be used to edit a reference image based on a prompt.

One-to-All Animation is a pose driven video model that animates characters from a single reference image, enabling flexible, alignment-free motion transfer across diverse styles and scenes

Upscales and cleans up the image.

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

Generate fast high quality video clips from text and image prompts using PixVerse v4

Generate video with audio from text using LTX-2 Distilled and custom LoRA

Generate video with audio from text using LTX-2 and custom LoRA

Generate video with audio from images using LTX-2 Distilled and custom LoRA

Run Any Stable Diffusion model with customizable LoRA weights.

Transform your character's hair into broccoli style while keeping the original characters likeness

Generate video with audio from audio, text and images using LTX-2.3 Distilled and custom LoRA

HDR surrealistic effect with intense colors

Create cinematic transitions and scene progressions (camera movements, framing changes)

LoRA endpoint for the Chrono Edit model.

Reduce color saturation using different methods (luminance Rec.709, luminance Rec.601, average, lightness) with adjustable factor.

A specialized FLUX endpoint combining differential diffusion control with LoRA, ControlNet, and IP-Adapter support, enabling precise, region-specific image transformations through customizable change maps.

Transforms images into comic book style

A unified speech-language model that synchronizes speech and text into a single, cohesive stream via 1:1 alignment. Lighter 1B variant

Extend videos with audio using LTX-2 Distilled

Superfast video model based on Wan 2.1 14b by Krea, excelling at real-time video-editing.

LoRA trainer for ERNIE-Image, Baidu's powerful 8B-parameter text-to-image model.

MultiTalk model generates a multi-person conversation video from an image and text inputs. Converts text to speech for each person, generating a realistic conversation scene.

Scribble preprocessor.