VACE is a video generation model that uses a source image, mask, and video to create prompted videos with controllable sources.
wan-vace-14b/outpainting
video-to-video

VACE is a video generation model that uses a source image, mask, and video to create prompted videos with controllable sources.

image-to-video
text-to-video
Generate long videos in 720p/30fps from text using LongCat Video
longcat-video/text-to-video/720p
text-to-video

Generate long videos in 720p/30fps from text using LongCat Video

Generate long speech snippets fast using Microsoft's powerful TTS.
vibevoice/0.5b
text-to-speech

Generate long speech snippets fast using Microsoft's powerful TTS.

vibevoice
fast
Start with a simple text input to create dynamic generations that defy expectations. Anything you dream can come to life with sharp details, impressive character control and cinematic camera moves.
pika/v2.1/text-to-video
text-to-video

Start with a simple text input to create dynamic generations that defy expectations. Anything you dream can come to life with sharp details, impressive character control and cinematic camera moves.

editing
effects
animation
Generate Images with ControlNet.
fast-sdxl-controlnet-canny/image-to-image
image-to-image

Generate Images with ControlNet.

diffusion
controlnet
editing
Generate 3D models from your images using Hunyuan 3D. A native 3D generative model enabling versatile and high-quality 3D asset creation.
hunyuan3d/v2/mini
image-to-3d

Generate 3D models from your images using Hunyuan 3D. A native 3D generative model enabling versatile and high-quality 3D asset creation.

stylized
Train custom LoRAs for Wan-2.2 T2V/I2V 480P
wan-22-trainer/i2v-a14b
training

Train custom LoRAs for Wan-2.2 T2V/I2V 480P

lora
video
Edit images with natural language
hidream-e1-1
image-to-image

Edit images with natural language

Removes harsh shadows and light spots from images, replacing them with soft, even, natural-looking illumination.
qwen-image-edit-2509-lora-gallery/lighting-restoration
image-to-image

Removes harsh shadows and light spots from images, replacing them with soft, even, natural-looking illumination.

stylized
transform
Anime finetune of Würstchen V3.
stable-cascade/sote-diffusion
text-to-image

Anime finetune of Würstchen V3.

lcm
stylized
Train styles, people and other subjects at blazing speeds using the FLUX.1 Krea [dev] base model.
flux-krea-trainer
training

Train styles, people and other subjects at blazing speeds using the FLUX.1 Krea [dev] base model.

lora
personalization
Apply film grain effect with different styles (modern, analog, kodak, fuji, cinematic, newspaper) and customizable intensity and scale
post-processing/grain
image-to-image

Apply film grain effect with different styles (modern, analog, kodak, fuji, cinematic, newspaper) and customizable intensity and scale

stylized
transform
Generate videos from prompts using LTX Video-0.9.7 13B and custom LoRA
ltx-video-13b-dev
text-to-video

Generate videos from prompts using LTX Video-0.9.7 13B and custom LoRA

video
ltx-video
Get waveform data from audio files using FFmpeg API.
ffmpeg-api/waveform
json

Get waveform data from audio files using FFmpeg API.

ffmpeg
High-quality text-to-image model by Baidu. Supports English, Chinese, and Japanese prompts with built-in prompt expansion.
ernie-image/lora
text-to-image

High-quality text-to-image model by Baidu. Supports English, Chinese, and Japanese prompts with built-in prompt expansion.

Erase unwanted objects, people, or elements from video with a text prompt. High-fidelity output with strong temporal consistency, trained on licensed data for safe commercial use.
bria/video/erase/prompt
video-to-video

Erase unwanted objects, people, or elements from video with a text prompt. High-fidelity output with strong temporal consistency, trained on licensed data for safe commercial use.

bria
video
erase
Hunyuan Video is an Open video generation model with high visual quality, motion diversity, text-video alignment, and generation stability. Use this endpoint to generate videos from videos.
hunyuan-video/video-to-video
video-to-video

Hunyuan Video is an Open video generation model with high visual quality, motion diversity, text-video alignment, and generation stability. Use this endpoint to generate videos from videos.

video to video
motion
MAGI-1 distilled generates videos faster from images with exceptional understanding of physical interactions and prompting
magi-distilled/image-to-video
image-to-video

MAGI-1 distilled generates videos faster from images with exceptional understanding of physical interactions and prompting

Qwen Image LoRA training
qwen-image-trainer-v2
training

Qwen Image LoRA training

lora
personalization
Convert your assets into lottie using Omnilottie.
omnilottie
json

Convert your assets into lottie using Omnilottie.

lottie
Generate video with audio from audio, text and images using LTX-2 and custom LoRA
ltx-2-19b/audio-to-video/lora
audio-to-video

Generate video with audio from audio, text and images using LTX-2 and custom LoRA

Generate seamlessly tiling photorealistic images from text using Z-Image Turbo and custom LoRA
z-image/turbo/tiling/lora
text-to-image

Generate seamlessly tiling photorealistic images from text using Z-Image Turbo and custom LoRA

z-image
turbo
seamless
Generate high quality and fast video clips from text and image prompts using PixVerse v4.5 fast
pixverse/v4.5/text-to-video/fast
text-to-video

Generate high quality and fast video clips from text and image prompts using PixVerse v4.5 fast

stylized
transform
Ballpoint pen sketch drawing style
flux-2-lora-gallery/ballpoint-pen-sketch
text-to-image

Ballpoint pen sketch drawing style

stylized
transform
YuE is a groundbreaking series of open-source foundation models designed for music generation, specifically for transforming lyrics into full songs.
yue
text-to-audio

YuE is a groundbreaking series of open-source foundation models designed for music generation, specifically for transforming lyrics into full songs.

music
Use the capabilities of hunyuan part to generate point clouds from your 3D files.
hunyuan-part
3d-to-3d

Use the capabilities of hunyuan part to generate point clouds from your 3D files.

3d-to-3d
point-cloud
F Lite is a 10B parameter diffusion model created by Fal and Freepik, trained exclusively on copyright-safe and SFW content.
f-lite/standard
text-to-image

F Lite is a 10B parameter diffusion model created by Fal and Freepik, trained exclusively on copyright-safe and SFW content.

Remove existing lighting and apply soft, even illumination
qwen-image-edit-plus-lora-gallery/remove-lighting
image-to-image

Remove existing lighting and apply soft, even illumination

stylized
transform
Showing 1009 to 1036 of 1354 results