High-quality avatar videos that feel real, generated from your text
argil/avatars/text-to-video
text-to-video

High-quality avatar videos that feel real, generated from your text

Reimagine uses a structure reference for generating new images while preserving the structure of an input image, guided by text prompts.
Perfect for transforming sketches, illustrations, or photos into new illustrations. Trained exclusively on licensed data
bria/reimagine/3.2
image-to-image

Reimagine uses a structure reference for generating new images while preserving the structure of an input image, guided by text prompts. Perfect for transforming sketches, illustrations, or photos into new illustrations. Trained exclusively on licensed data

bria
Animate a reference image with a driving video using ControlNeXt.
controlnext
video-to-video

Animate a reference image with a driving video using ControlNeXt.

animation
stylized
Run SDXL at the speed of light
fast-lcm-diffusion/inpainting
image-to-image

Run SDXL at the speed of light

lcm
diffusion
turbo
Run SDXL at the speed of light
fast-lightning-sdxl/inpainting
image-to-image

Run SDXL at the speed of light

diffusion
lightning
The model provides you high quality image editing capabilities.
flowedit
image-to-image

The model provides you high quality image editing capabilities.

editing
HunyuanPortrait is a diffusion-based framework for generating lifelike, temporally consistent portrait animations.
hunyuan-portrait
image-to-video

HunyuanPortrait is a diffusion-based framework for generating lifelike, temporally consistent portrait animations.

animation
lip sync
Image to Video for the Hunyuan Video model using a custom trained LoRA.
hunyuan-video-img2vid-lora
image-to-video

Image to Video for the Hunyuan Video model using a custom trained LoRA.

motion
Add realistic weather effects like snowfall, rain, or fog to your photos while maintaining the scene's mood.
image-editing/weather-effect
image-to-image

Add realistic weather effects like snowfall, rain, or fog to your photos while maintaining the scene's mood.

stylized
transform
Generate videos from prompts, images, and videos using LTX Video-0.9.7 and custom LoRA
ltx-video-lora/multiconditioning
video-to-video

Generate videos from prompts, images, and videos using LTX Video-0.9.7 and custom LoRA

video
ltx-video
multicondition-to-video
Generate videos from prompts and videos using LTX Video-0.9.5
ltx-video-v095/extend
video-to-video

Generate videos from prompts and videos using LTX Video-0.9.5

video
Extend videos using LTX Video-0.9.8 13B Distilled and custom LoRA
ltxv-13b-098-distilled/extend
video-to-video

Extend videos using LTX Video-0.9.8 13B Distilled and custom LoRA

ltx-video
extend
MAGI-1 distilled is a faster video generation model with exceptional understanding of physical interactions and cinematic prompts
magi-distilled
text-to-video

MAGI-1 distilled is a faster video generation model with exceptional understanding of physical interactions and cinematic prompts

Add a darkening vignette effect around the edges of the image with adjustable strength
post-processing/vignette
image-to-image

Add a darkening vignette effect around the edges of the image with adjustable strength

stylized
transform
Use the 6D pose estimation capabilities of PSHuman to generate 3D files from single image.
pshuman
image-to-3d

Use the 6D pose estimation capabilities of PSHuman to generate 3D files from single image.

image-to-3d
Sa2VA is an MLLM capable of question answering, visual prompt understanding, and dense object segmentation at both image and video levels
sa2va/8b/video
vision

Sa2VA is an MLLM capable of question answering, visual prompt understanding, and dense object segmentation at both image and video levels

multimodal
Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
sadtalker/reference
image-to-video

Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

animation
An open source, community-driven and native audio turn detection model by Pipecat AI.
smart-turn
speech-to-text

An open source, community-driven and native audio turn detection model by Pipecat AI.

Extend videos with audio using LTX-2 Distilled and custom LoRA
ltx-2-19b/distilled/extend-video/lora
video-to-video

Extend videos with audio using LTX-2 Distilled and custom LoRA

Generate full portrait from a cropped face photo
qwen-image-edit-2509-lora-gallery/face-to-full-portrait
image-to-image

Generate full portrait from a cropped face photo

stylized
transform
Generate video with audio from videos using LTX-2 Distilled and custom LoRA
ltx-2-19b/distilled/video-to-video/lora
video-to-video

Generate video with audio from videos using LTX-2 Distilled and custom LoRA

Extend video with audio using LTX-2 and custom LoRA
ltx-2-19b/extend-video/lora
video-to-video

Extend video with audio using LTX-2 and custom LoRA

Create group photos
qwen-image-edit-2509-lora-gallery/group-photo
image-to-image

Create group photos

stylized
transform
Enhance wraped, folded documents with the superior quality of docres for sharper, clearer results.
docres/dewarp
image-to-image

Enhance wraped, folded documents with the superior quality of docres for sharper, clearer results.

image-enhancement
FLUX.1 Differential Diffusion is a rapid endpoint that enables swift, granular control over image transformations through change maps, delivering fast and precise region-specific modifications while maintaining FLUX.1 [dev]'s high-quality output.
flux-differential-diffusion
image-to-image

FLUX.1 Differential Diffusion is a rapid endpoint that enables swift, granular control over image transformations through change maps, delivering fast and precise region-specific modifications while maintaining FLUX.1 [dev]'s high-quality output.

transformation
Generate expressive, natural speech with Resemble AI's Chatterbox. Features unique emotion control, instant voice cloning from short audio, and built-in watermarking.
resemble-ai/chatterboxhd/text-to-speech
text-to-speech

Generate expressive, natural speech with Resemble AI's Chatterbox. Features unique emotion control, instant voice cloning from short audio, and built-in watermarking.

Structured Prompt Generation endpoint for Fibo-Lite, Bria's SOTA Open source model
bria/fibo-lite/generate/structured_prompt/lite
text-to-json

Structured Prompt Generation endpoint for Fibo-Lite, Bria's SOTA Open source model

bria
structured-prompting
Bria’s Text-to-Image model, trained exclusively on licensed data for safe and risk-free commercial use. Excels in Text-Rendering and Aesthetics.
bria/text-to-image/3.2
text-to-image

Bria’s Text-to-Image model, trained exclusively on licensed data for safe and risk-free commercial use. Excels in Text-Rendering and Aesthetics.

image generation
Showing 1289 to 1316 of 1355 results