Generate high quality music and sound effects using Stable Audio 2.5 from StabilityAI
stable-audio-25/inpaint
audio-to-audio

Generate high quality music and sound effects using Stable Audio 2.5 from StabilityAI

audio
Apply Gaussian or Kuwahara blur effects with adjustable radius and sigma parameters
post-processing/blur
image-to-image

Apply Gaussian or Kuwahara blur effects with adjustable radius and sigma parameters

stylized
transform
DreamO is an image customization framework designed to support a wide range of tasks while facilitating seamless integration of multiple conditions.
dreamo
text-to-image

DreamO is an image customization framework designed to support a wide range of tasks while facilitating seamless integration of multiple conditions.

stylized
realism
Turn any image into a cute plushie!
plushify
image-to-image

Turn any image into a cute plushie!

MAGI-1 distilled extends videos faster with an exceptional understanding of physical interactions and prompts
magi-distilled/extend-video
video-to-video

MAGI-1 distilled extends videos faster with an exceptional understanding of physical interactions and prompts

video-extend
A general purpose endpoint for the FLUX.1 [dev] model, implementing the RF-Inversion pipeline. This can be used to edit a reference image based on a prompt.
flux-general/rf-inversion
image-to-image

A general purpose endpoint for the FLUX.1 [dev] model, implementing the RF-Inversion pipeline. This can be used to edit a reference image based on a prompt.

rf-inversion
editing
lora
One-to-All Animation is a pose driven video model that animates characters from a single reference image, enabling flexible, alignment-free motion transfer across diverse styles and scenes
one-to-all-animation/14b
video-to-video

One-to-All Animation is a pose driven video model that animates characters from a single reference image, enabling flexible, alignment-free motion transfer across diverse styles and scenes

video to video
motion
Upscales and cleans up the image.
chrono-edit-lora-gallery/upscaler
image-to-image

Upscales and cleans up the image.

upscale
details
Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks
florence-2-large/region-to-segmentation
image-to-image

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

multimodal
vision
segmentation
Generate fast high quality video clips from text and image prompts using PixVerse v4
pixverse/v4/image-to-video/fast
image-to-video

Generate fast high quality video clips from text and image prompts using PixVerse v4

Generate video with audio from text using LTX-2 Distilled and custom LoRA
ltx-2-19b/distilled/text-to-video/lora
text-to-video

Generate video with audio from text using LTX-2 Distilled and custom LoRA

Generate video with audio from text using LTX-2 and custom LoRA
ltx-2-19b/text-to-video/lora
text-to-video

Generate video with audio from text using LTX-2 and custom LoRA

Generate video with audio from images using LTX-2 Distilled and custom LoRA
ltx-2-19b/distilled/image-to-video/lora
image-to-video

Generate video with audio from images using LTX-2 Distilled and custom LoRA

Run Any Stable Diffusion model with customizable LoRA weights.
lora/inpaint
image-to-image

Run Any Stable Diffusion model with customizable LoRA weights.

diffusion
lora
customization
Transform your character's hair into broccoli style while keeping the original characters likeness
image-editing/broccoli-haircut
image-to-image

Transform your character's hair into broccoli style while keeping the original characters likeness

stylized
transform
Generate video with audio from audio, text and images using LTX-2.3 Distilled and custom LoRA
ltx-2.3-22b/distilled/audio-to-video/lora
audio-to-video

Generate video with audio from audio, text and images using LTX-2.3 Distilled and custom LoRA

HDR surrealistic effect with intense colors
flux-2-lora-gallery/hdr-style
text-to-image

HDR surrealistic effect with intense colors

stylized
transform
Create cinematic transitions and scene progressions (camera movements, framing changes)
qwen-image-edit-plus-lora-gallery/next-scene
image-to-image

Create cinematic transitions and scene progressions (camera movements, framing changes)

stylized
transform
LoRA endpoint for the Chrono Edit model.
chrono-edit-lora
image-to-image

LoRA endpoint for the Chrono Edit model.

image-editing
Reduce color saturation using different methods (luminance Rec.709, luminance Rec.601, average, lightness) with adjustable factor.
post-processing/desaturate
image-to-image

Reduce color saturation using different methods (luminance Rec.709, luminance Rec.601, average, lightness) with adjustable factor.

stylized
transform
A specialized FLUX endpoint combining differential diffusion control with LoRA, ControlNet, and IP-Adapter support, enabling precise, region-specific image transformations through customizable change maps.
flux-general/differential-diffusion
image-to-image

A specialized FLUX endpoint combining differential diffusion control with LoRA, ControlNet, and IP-Adapter support, enabling precise, region-specific image transformations through customizable change maps.

lora
controlnet
ip-adapter
Transforms images into comic book style
flux-2-lora-gallery/digital-comic-art
text-to-image

Transforms images into comic book style

stylized
transform
A unified speech-language model that synchronizes speech and text into a single, cohesive stream via 1:1 alignment. Lighter 1B variant
tada/1b/text-to-speech
audio-to-audio

A unified speech-language model that synchronizes speech and text into a single, cohesive stream via 1:1 alignment. Lighter 1B variant

Extend videos with audio using LTX-2 Distilled
ltx-2-19b/distilled/extend-video
video-to-video

Extend videos with audio using LTX-2 Distilled

Superfast video model based on Wan 2.1 14b by Krea, excelling at real-time video-editing.
krea-wan-14b/video-to-video
video-to-video

Superfast video model based on Wan 2.1 14b by Krea, excelling at real-time video-editing.

LoRA trainer for ERNIE-Image, Baidu's powerful 8B-parameter text-to-image model.
ernie-image-trainer
training

LoRA trainer for ERNIE-Image, Baidu's powerful 8B-parameter text-to-image model.

lora
personalization
trainer
MultiTalk model generates a multi-person conversation video from an image and text inputs. Converts text to speech for each person, generating a realistic conversation scene.
ai-avatar/multi-text
image-to-video

MultiTalk model generates a multi-person conversation video from an image and text inputs. Converts text to speech for each person, generating a realistic conversation scene.

stylized
transform
Scribble preprocessor.
image-preprocessors/scribble
image-to-image

Scribble preprocessor.

preprocess
utility
editing
Showing 1149 to 1176 of 1354 results