Applies sepia vintage effect to images
flux-2-lora-gallery/sepia-vintage
text-to-image

Applies sepia vintage effect to images

stylized
transform
Switti is a scale-wise transformer for fast text-to-image generation that outperforms existing T2I AR models and competes with state-of-the-art T2I diffusion models while being faster than distilled diffusion models.
switti/512
text-to-image

Switti is a scale-wise transformer for fast text-to-image generation that outperforms existing T2I AR models and competes with state-of-the-art T2I diffusion models while being faster than distilled diffusion models.

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks
florence-2-large/region-to-description
vision

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

multimodal
TEED (Temporal Edge Enhancement Detection) preprocessor.
image-preprocessors/teed
image-to-image

TEED (Temporal Edge Enhancement Detection) preprocessor.

preprocess
detection
utility
Apply designs/graphics onto people's shirts
qwen-image-edit-plus-lora-gallery/shirt-design
image-to-image

Apply designs/graphics onto people's shirts

stylized
transform
Edit outfits, objects, faces, or restyle your video - all with maximum detail retention.
decart/lucy-edit/pro
video-to-video

Edit outfits, objects, faces, or restyle your video - all with maximum detail retention.

video-edit
Generate video with audio from images using LTX-2 and custom LoRA
ltx-2-19b/image-to-video/lora
image-to-video

Generate video with audio from images using LTX-2 and custom LoRA

Edit images from your prompts using Luma Photon. Photon is the most creative, personalizable, and intelligent visual models for creatives, bringing a step-function change in the cost of high-quality image generation.
luma-photon/flash/modify
image-to-image

Edit images from your prompts using Luma Photon. Photon is the most creative, personalizable, and intelligent visual models for creatives, bringing a step-function change in the cost of high-quality image generation.

SCAIL is a character animation model that uses 3D consistent pose representations to animate reference images with coherent motion, supporting complex movements.
scail
video-to-video

SCAIL is a character animation model that uses 3D consistent pose representations to animate reference images with coherent motion, supporting complex movements.

Heygen Avatar V3 Model for Digital Twin
heygen/avatar3/digital-twin
text-to-video

Heygen Avatar V3 Model for Digital Twin

FFMPEG Utility for Audio Compression
workflow-utilities/audio-compressor
audio-to-audio

FFMPEG Utility for Audio Compression

Generate short video clips from your images using SVD v1.1 at Lightning Speed
fast-svd-lcm/text-to-video
text-to-video

Generate short video clips from your images using SVD v1.1 at Lightning Speed

lcm
diffusion
turbo
Semantic image alignment measurements
arbiter/image/text
vision

Semantic image alignment measurements

clip-score
Extend video with audio using LTX-2.3 and custom LoRA
ltx-2.3-22b/extend-video/lora
video-to-video

Extend video with audio using LTX-2.3 and custom LoRA

Generates satellite/aerial view style images
flux-2-lora-gallery/satellite-view-style
text-to-image

Generates satellite/aerial view style images

stylized
transform
Switti is a scale-wise transformer for fast text-to-image generation that outperforms existing T2I AR models and competes with state-of-the-art T2I diffusion models while being faster than distilled diffusion models.
switti
text-to-image

Switti is a scale-wise transformer for fast text-to-image generation that outperforms existing T2I AR models and competes with state-of-the-art T2I diffusion models while being faster than distilled diffusion models.

Restyle videos up to 30 min long - maintaining maximum detail quality.
decart/lucy-restyle
video-to-video

Restyle videos up to 30 min long - maintaining maximum detail quality.

video-edit
High-fidelity keypoint-driven video object removal - minimal input, strong temporal consistency. Trained on licensed data for risk-free commercial video editing.
bria/video/erase/keypoints
video-to-video

High-fidelity keypoint-driven video object removal - minimal input, strong temporal consistency. Trained on licensed data for risk-free commercial video editing.

bria
video
erase
Generate video with audio from videos using LTX-2 and custom LoRA
ltx-2-19b/video-to-video/lora
video-to-video

Generate video with audio from videos using LTX-2 and custom LoRA

Fast Text-to-Video endpoint for Krea's Wan 14b model.
krea-wan-14b/text-to-video
text-to-video

Fast Text-to-Video endpoint for Krea's Wan 14b model.

text to video
fast
Reference-free image measurements
arbiter/image
vision

Reference-free image measurements

arniqa
nima
iqa
LongCat-Video-Avatar is an audio-driven video generation model that can generates super-realistic, lip-synchronized long video generation with natural dynamics and consistent identity.
longcat-multi-avatar/image-audio-to-video
audio-to-video

LongCat-Video-Avatar is an audio-driven video generation model that can generates super-realistic, lip-synchronized long video generation with natural dynamics and consistent identity.

image-to-video
Re-animate your videos in lightning speed!
fast-animatediff/turbo/video-to-video
video-to-video

Re-animate your videos in lightning speed!

animation
stylized
turbo
Maya1 is a state-of-the-art speech model by Maya Research for expressive voice generation, built to capture real human emotion and precise voice design.
maya/stream
text-to-speech

Maya1 is a state-of-the-art speech model by Maya Research for expressive voice generation, built to capture real human emotion and precise voice design.

tts
Train LoRAs for the Qwen-Image-Layered model, customize how images are split into layers.
qwen-image-layered-trainer
training

Train LoRAs for the Qwen-Image-Layered model, customize how images are split into layers.

qwen
layer
trainer
An efficent SDXL multi-controlnet inpainting model.
sdxl-controlnet-union/inpainting
image-to-image

An efficent SDXL multi-controlnet inpainting model.

diffusion
controlnet
composition
Generate high quality and fast video clips from text and image prompts using PixVerse v4 fast
pixverse/v4/text-to-video/fast
text-to-video

Generate high quality and fast video clips from text and image prompts using PixVerse v4 fast

Train Ideogram on your photos, your style, your subject, your look, from a small set of reference images to images that feel consistently yours
new
ideogram/custom-models
training

Train Ideogram on your photos, your style, your subject, your look, from a small set of reference images to images that feel consistently yours

stylized
transform
Showing 1205 to 1232 of 1355 results