Rapidly create image variations with Ideogram V2 Turbo Remix. Fast and efficient reimagining of existing images while maintaining creative control through prompt guidance.
ideogram/v2/turbo/remix
image-to-image

Rapidly create image variations with Ideogram V2 Turbo Remix. Fast and efficient reimagining of existing images while maintaining creative control through prompt guidance.

realism
typography
GOT-OCR2 works on a wide range of tasks, including plain document OCR, scene text OCR, formatted document OCR, and even OCR for tables, charts, mathematical formulas, geometric shapes, molecular formulas and sheet music.
got-ocr/v2
vision

GOT-OCR2 works on a wide range of tasks, including plain document OCR, scene text OCR, formatted document OCR, and even OCR for tables, charts, mathematical formulas, geometric shapes, molecular formulas and sheet music.

optical character recognition
high-res
utility
Generate long videos from images using LongCat Video Distilled
longcat-video/distilled/image-to-video/480p
image-to-video

Generate long videos from images using LongCat Video Distilled

FLUX.1 Krea [dev] Redux is a high-performance endpoint for the FLUX.1 Krea [dev] model that enables rapid transformation of existing images, delivering high-quality style transfers and image modifications with the core FLUX capabilities.
flux-1/krea/redux
image-to-image

FLUX.1 Krea [dev] Redux is a high-performance endpoint for the FLUX.1 Krea [dev] model that enables rapid transformation of existing images, delivering high-quality style transfers and image modifications with the core FLUX capabilities.

Kling LipSync is a text-to-video model that generates realistic lip movements from text input.
kling-video/lipsync/text-to-video
text-to-video

Kling LipSync is a text-to-video model that generates realistic lip movements from text input.

text to video
lipsync
Leverage the rapid processing capabilities of AI models to enable accurate and efficient real-time speech-to-text transcription.
speech-to-text
speech-to-text

Leverage the rapid processing capabilities of AI models to enable accurate and efficient real-time speech-to-text transcription.

Add immersive sound effects and background music to your videos using PixVerse sound effects  generation
pixverse/sound-effects
video-to-video

Add immersive sound effects and background music to your videos using PixVerse sound effects generation

audio
utility
Finegrain Eraser removes any object selected with a bounding box—along with its shadows, reflections, and lighting artifacts—seamlessly reconstructing the scene with contextually accurate content.
finegrain-eraser/bbox
image-to-image

Finegrain Eraser removes any object selected with a bounding box—along with its shadows, reflections, and lighting artifacts—seamlessly reconstructing the scene with contextually accurate content.

utility
editing
Generate video with audio from reference video, text and images using LTX-2.3
ltx-2.3-22b/reference-video-to-video
video-to-video

Generate video with audio from reference video, text and images using LTX-2.3

VACE Fun for Wan 2.2 A14B from Alibaba-PAI
wan-22-vace-fun-a14b/outpainting
video-to-video

VACE Fun for Wan 2.2 A14B from Alibaba-PAI

FFMPEG Utility to Reverse Videos
workflow-utilities/reverse-video
video-to-video

FFMPEG Utility to Reverse Videos

ZoeDepth preprocessor.
image-preprocessors/zoe
image-to-image

ZoeDepth preprocessor.

depth
preprocess
utility
Pull motion from a reference video and apply it to new subjects or scenes.
moonvalley/marey/motion-transfer
video-to-video

Pull motion from a reference video and apply it to new subjects or scenes.

Change facial expressions in photos to any emotion you desire, from smiles to serious looks.
image-editing/expression-change
image-to-image

Change facial expressions in photos to any emotion you desire, from smiles to serious looks.

stylized
transform
Transform any person into their baby version, while preserving the original pose and expression with childlike features.
image-editing/baby-version
image-to-image

Transform any person into their baby version, while preserving the original pose and expression with childlike features.

stylized
transform
LoRA trainer for Qwen Image Edit 2511
qwen-image-edit-2511-trainer
training

LoRA trainer for Qwen Image Edit 2511

Vidu Q1 Text to Video generates high-quality 1080p videos with exceptional visual quality and motion diversity
vidu/q1/text-to-video
text-to-video

Vidu Q1 Text to Video generates high-quality 1080p videos with exceptional visual quality and motion diversity

stylized
transform
Convert plain text into Fibo-Lite's transparent JSON-structured prompts - Bria's unique controllability layer that no closed model offers. Built for agentic and enterprise workflows.
bria/fibo-lite/generate/structured_prompt
text-to-json

Convert plain text into Fibo-Lite's transparent JSON-structured prompts - Bria's unique controllability layer that no closed model offers. Built for agentic and enterprise workflows.

bria
fibo
structured-prompt
Generate video clips from your prompts using Kling 1.6 (pro)
kling-video/v1.6/pro/effects
text-to-video

Generate video clips from your prompts using Kling 1.6 (pro)

Change facial expressions in photos with realistic results.
image-apps-v2/expression-change
image-to-image

Change facial expressions in photos with realistic results.

face-edit
expression-change
Generate video from audio using LTX-2
ltx-2/audio-to-video
audio-to-video

Generate video from audio using LTX-2

stylized
transform
lipsync
FLUX.1 SRPO [dev] is a 12 billion parameter flow transformer that generates high-quality images from text with incredible aesthetics. It is suitable for personal and commercial use.
flux-1/srpo/image-to-image
image-to-image

FLUX.1 SRPO [dev] is a 12 billion parameter flow transformer that generates high-quality images from text with incredible aesthetics. It is suitable for personal and commercial use.

FFMPEG Untility for Extracting nth Frame
workflow-utilities/extract-nth-frame
image-to-image

FFMPEG Untility for Extracting nth Frame

Extend video with audio using LTX-2
ltx-2-19b/extend-video
video-to-video

Extend video with audio using LTX-2

Generate video with audio from audio, text and images using LTX-2 Distilled
ltx-2.3-22b/distilled/audio-to-video
audio-to-video

Generate video with audio from audio, text and images using LTX-2 Distilled

Generate video with audio from text using LTX-2.3 and custom LoRA
ltx-2.3-22b/text-to-video/lora
text-to-video

Generate video with audio from text using LTX-2.3 and custom LoRA

Use the latest Vidu Q2 Pro models which much more better quality and control on your videos.
vidu/q2/reference-to-video/pro
image-to-video

Use the latest Vidu Q2 Pro models which much more better quality and control on your videos.

Wan 2.2 text to image LoRA trainer. Fine-tune Wan 2.2 for subjects and styles with unprecedented detail.
wan-22-image-trainer
training

Wan 2.2 text to image LoRA trainer. Fine-tune Wan 2.2 for subjects and styles with unprecedented detail.

lora
personalization
Showing 925 to 952 of 1354 results