Easily adjust the perspective of any image to different angles.
image-apps-v2/perspective
image-to-image

Easily adjust the perspective of any image to different angles.

change-angle
perspective
RunDiffusion Photo Flux provides insane realism. With this enhancer, textures and skin details burst to life, turning your favorite prompts into vivid, lifelike creations. Recommended to keep it at 0.65 to 0.80 weight. Supports resolutions up to 1536x1536.
rundiffusion-fal/rundiffusion-photo-flux
text-to-image

RunDiffusion Photo Flux provides insane realism. With this enhancer, textures and skin details burst to life, turning your favorite prompts into vivid, lifelike creations. Recommended to keep it at 0.65 to 0.80 weight. Supports resolutions up to 1536x1536.

image generation
lora
Place products naturally in a person’s hands for realistic marketing visuals.
image-apps-v2/product-holding
image-to-image

Place products naturally in a person’s hands for realistic marketing visuals.

product
marketing
Adjust color temperature, brightness, contrast, saturation, and gamma values for color correction.
post-processing/color-correction
image-to-image

Adjust color temperature, brightness, contrast, saturation, and gamma values for color correction.

stylized
transform
VACE Fun for Wan 2.2 A14B from Alibaba-PAI
wan-22-vace-fun-a14b/reframe
video-to-video

VACE Fun for Wan 2.2 A14B from Alibaba-PAI

Qwen Image 2512 LoRA training
qwen-image-2512-trainer
training

Qwen Image 2512 LoRA training

lora
personalization
Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks
florence-2-large/dense-region-caption
image-to-image

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

multimodal
vision
An AI model that transforms input images into new ones based on text prompts, blending reference visuals with your creative directions.
uno
image-to-image

An AI model that transforms input images into new ones based on text prompts, blending reference visuals with your creative directions.

Pixel-Aware Diffusion Model for Realistic Image Super-Resolution and Personalized Stylization
pasd
image-to-image

Pixel-Aware Diffusion Model for Realistic Image Super-Resolution and Personalized Stylization

utility
editing
Clone your voices using Qwen3-TTS Clone-Voice model with zero shot cloning capabilities and use it on text-to-speech models to create speeches of yours!
qwen-3-tts/clone-voice/0.6b
audio-to-audio

Clone your voices using Qwen3-TTS Clone-Voice model with zero shot cloning capabilities and use it on text-to-speech models to create speeches of yours!

clone-voice
voice-clone
Create seamless transition between images using PixVerse v3.5
pixverse/v3.5/transition
image-to-video

Create seamless transition between images using PixVerse v3.5

Generate video clips more accurately with respect to initial image, natural language descriptions, and using camera movement instructions for shot control.
minimax/video-01-director/image-to-video
image-to-video

Generate video clips more accurately with respect to initial image, natural language descriptions, and using camera movement instructions for shot control.

motion
transformation
camera-controls
Image-to-image editing with Step1X-Edit v2 from StepFun. Reasoning-enhanced modifications through a thinking–editing–reflection loop with MLLM world knowledge for abstract instruction comprehension.
stepx-edit2
image-to-image

Image-to-image editing with Step1X-Edit v2 from StepFun. Reasoning-enhanced modifications through a thinking–editing–reflection loop with MLLM world knowledge for abstract instruction comprehension.

FLUX.1 Krea [dev] is a 12 billion parameter flow transformer that generates high-quality images from text with incredible aesthetics. It is suitable for personal and commercial use.
flux-1/krea/image-to-image
image-to-image

FLUX.1 Krea [dev] is a 12 billion parameter flow transformer that generates high-quality images from text with incredible aesthetics. It is suitable for personal and commercial use.

Expressive facial performance, natural speech-expression coordination, realistic body motion, and accurate audio-video synchronization with DaVinci-MagiHuman model
davinci-magihuman
image-to-video

Expressive facial performance, natural speech-expression coordination, realistic body motion, and accurate audio-video synchronization with DaVinci-MagiHuman model

animation
lip sync
Image colorization and color-grading model. Bring color to black-and-white photos or apply curated color treatments using simple style-based commands.
bria/fibo-edit/colorize
image-to-image

Image colorization and color-grading model. Bring color to black-and-white photos or apply curated color treatments using simple style-based commands.

bria
fibo-edit
color
Extends videos with audio using LTX-2
ltx-2/extend-video
video-to-video

Extends videos with audio using LTX-2

Generate character ids to use with Sora 2 generations
sora-2/characters
image-to-video

Generate character ids to use with Sora 2 generations

Retouch photos of faces. Remove blemishes and improve the skin.
image-editing/retouch
image-to-image

Retouch photos of faces. Remove blemishes and improve the skin.

Generate YouTube thumbnails with custom text
image-editing/youtube-thumbnails
image-to-image

Generate YouTube thumbnails with custom text

stylized
transform
Generate fast high quality video clips from text and image prompts using PixVerse v4.5
pixverse/v4.5/image-to-video/fast
image-to-video

Generate fast high quality video clips from text and image prompts using PixVerse v4.5

stylized
transform
Enhance and refine portrait photos with improved clarity and detail.
image-apps-v2/portrait-enhance
image-to-image

Enhance and refine portrait photos with improved clarity and detail.

image-edit
enhancement
VACE is a video generation model that uses a source image, mask, and video to create prompted videos with controllable sources.
wan-vace-14b
video-to-video

VACE is a video generation model that uses a source image, mask, and video to create prompted videos with controllable sources.

image-to-video
text-to-video
EchoMimic V3 generates a talking avatar model from a picture, audio and text prompt.
echomimic-v3
audio-to-video

EchoMimic V3 generates a talking avatar model from a picture, audio and text prompt.

echomimic
talking-head
High-fidelity mask-based video object removal with strong temporal consistency. Erase unwanted objects, people, or elements while preserving aesthetic quality. Trained on licensed data for risk-free commercial use.
bria/video/erase/mask
video-to-video

High-fidelity mask-based video object removal with strong temporal consistency. Erase unwanted objects, people, or elements while preserving aesthetic quality. Trained on licensed data for risk-free commercial use.

bria
video
erase
Generate a video starting from an image as the first frame with Marey, a generative video model trained exclusively on fully licensed data.
moonvalley/marey/i2v
image-to-video

Generate a video starting from an image as the first frame with Marey, a generative video model trained exclusively on fully licensed data.

MoonDreamNext Batch is a multimodal vision-language model for batch captioning.
moondream-next/batch
vision

MoonDreamNext Batch is a multimodal vision-language model for batch captioning.

multimodal
Generate high quality images from text prompts using CogView4. Longer text prompts will result in better quality images.
cogview4
text-to-image

Generate high quality images from text prompts using CogView4. Longer text prompts will result in better quality images.

stylized
Showing 981 to 1008 of 1354 results