Image editing with FLUX.2 [flex] from Black Forest Labs. Supports multi-reference editing with customizable inference steps and enhanced text rendering.
flux-2-flex/edit
image-to-image

Image editing with FLUX.2 [flex] from Black Forest Labs. Supports multi-reference editing with customizable inference steps and enhanced text rendering.

MiniMax Hailuo-02 Image To Video API (Pro, 1080p): Advanced image-to-video generation model with 1080p resolution
minimax/hailuo-02/pro/image-to-video
image-to-video

MiniMax Hailuo-02 Image To Video API (Pro, 1080p): Advanced image-to-video generation model with 1080p resolution

Generate realistic lipsync animations from audio using advanced algorithms for high-quality synchronization.
sync-lipsync
video-to-video

Generate realistic lipsync animations from audio using advanced algorithms for high-quality synchronization.

animation
lip sync
Generate high quality music and sound effects using Stable Audio 2.5 from StabilityAI
stable-audio-25/text-to-audio
text-to-audio

Generate high quality music and sound effects using Stable Audio 2.5 from StabilityAI

audio
Veo 3 by Google, the most advanced AI video generation model in the world. With sound on!
veo3
text-to-video

Veo 3 by Google, the most advanced AI video generation model in the world. With sound on!

Bria Expand expands images beyond their borders in high quality. Trained exclusively on licensed data for safe and risk-free commercial use. Access the model's source code and weights: https://bria.ai/contact-us
bria/expand
image-to-image

Bria Expand expands images beyond their borders in high quality. Trained exclusively on licensed data for safe and risk-free commercial use. Access the model's source code and weights: https://bria.ai/contact-us

outpainting
Generate realistic lipsync animations from audio using advanced algorithms for high-quality synchronization with Sync Lipsync 2.0 model
sync-lipsync/v2
video-to-video

Generate realistic lipsync animations from audio using advanced algorithms for high-quality synchronization with Sync Lipsync 2.0 model

animation
lip sync
Generate realistic videos using Kling O3 from Kling Team!
kling-video/o3/pro/text-to-video
text-to-video

Generate realistic videos using Kling O3 from Kling Team!

Generate images from text and images using Z-Image Turbo, Tongyi-MAI's super-fast 6B model.
z-image/turbo/image-to-image
image-to-image

Generate images from text and images using Z-Image Turbo, Tongyi-MAI's super-fast 6B model.

turbo
z-image
fast
Kling AI Avatar v2 Pro: The premium endpoint for creating avatar videos with realistic humans, animals, cartoons, or stylized characters
kling-video/ai-avatar/v2/pro
image-to-video

Kling AI Avatar v2 Pro: The premium endpoint for creating avatar videos with realistic humans, animals, cartoons, or stylized characters

Omnihuman v1.5 is a new and improved version of Omnihuman. It generates video using an image of a human figure paired with an audio file. It produces vivid, high-quality videos where the character’s emotions and movements maintain a strong correlation with the audio.
bytedance/omnihuman/v1.5
image-to-video

Omnihuman v1.5 is a new and improved version of Omnihuman. It generates video using an image of a human figure paired with an audio file. It produces vivid, high-quality videos where the character’s emotions and movements maintain a strong correlation with the audio.

lipsync
MiniMax Music 2.6 creates complete tracks with singing, backing music, and detailed arrangements from lyrics and a style description.
minimax-music/v2.6
text-to-audio

MiniMax Music 2.6 creates complete tracks with singing, backing music, and detailed arrangements from lyrics and a style description.

stylized
transform
lipsync
Generate speech from text prompts and different voices using the MiniMax Speech-02 HD model, which leverages advanced AI techniques to create high-quality text-to-speech.
minimax/speech-02-hd
text-to-speech

Generate speech from text prompts and different voices using the MiniMax Speech-02 HD model, which leverages advanced AI techniques to create high-quality text-to-speech.

speech
Merge videos with standalone audio files or audio from video files.
ffmpeg-api/merge-audio-video
video-to-video

Merge videos with standalone audio files or audio from video files.

ffmpeg
Text-to-image generation with LoRA support for FLUX.2 [dev] from Black Forest Labs. Custom style adaptation and fine-tuned model variations.
flux-2/lora
text-to-image

Text-to-image generation with LoRA support for FLUX.2 [dev] from Black Forest Labs. Custom style adaptation and fine-tuned model variations.

Veo 3.1 Lite balances practical utility with professional capabilities, supporting Text-to-Video and Image-to-Video
veo3.1/lite
text-to-video

Veo 3.1 Lite balances practical utility with professional capabilities, supporting Text-to-Video and Image-to-Video

stylized
transform
lipsync
Kling Omni 3: Top-tier image-to-image with flawless consistency.
kling-image/o3/image-to-image
image-to-image

Kling Omni 3: Top-tier image-to-image with flawless consistency.

Text-to-Image endpoint with LoRA support for Z-Image Turbo, a super fast text-to-image model of 6B parameters developed by Tongyi-MAI.
z-image/turbo/lora
text-to-image

Text-to-Image endpoint with LoRA support for Z-Image Turbo, a super fast text-to-image model of 6B parameters developed by Tongyi-MAI.

z-image
lora
fast
Kling AI Avatar v2 Standard:  Endpoint for creating avatar videos with realistic humans, animals, cartoons, or stylized characters
kling-video/ai-avatar/v2/standard
image-to-video

Kling AI Avatar v2 Standard: Endpoint for creating avatar videos with realistic humans, animals, cartoons, or stylized characters

Upscale your videos using SeedVR2 with temporal consistency!
seedvr/upscale/video
video-to-video

Upscale your videos using SeedVR2 with temporal consistency!

upscale
FLUX LoRA Image-to-Image is a high-performance endpoint that transforms existing images using FLUX models, leveraging LoRA adaptations to enable rapid and precise image style transfer, modifications, and artistic variations.
flux-lora/image-to-image
image-to-image

FLUX LoRA Image-to-Image is a high-performance endpoint that transforms existing images using FLUX models, leveraging LoRA adaptations to enable rapid and precise image style transfer, modifications, and artistic variations.

lora
style transfer
ffmpeg endpoint for first, middle and last frame extraction from videos
ffmpeg-api/extract-frame
image-to-image

ffmpeg endpoint for first, middle and last frame extraction from videos

utility
editing
Endpoint for Qwen's Image Editing model. Has superior text editing capabilities.
qwen-image-edit
image-to-image

Endpoint for Qwen's Image Editing model. Has superior text editing capabilities.

image-editing
high-quality-text
Heygen Photo Avatar 4 Model
heygen/avatar4/image-to-video
image-to-video

Heygen Photo Avatar 4 Model

LTX-2.3 is a high-quality, fast AI video model available in Pro and Fast variants for text-to-video, image-to-video, and audio-to-video.
ltx-2.3/image-to-video
image-to-video

LTX-2.3 is a high-quality, fast AI video model available in Pro and Fast variants for text-to-video, image-to-video, and audio-to-video.

stylized
transform
lipsync
Generate high-quality realistic lipsync animations from audio while preserving unique details like natural teeth and unique facial features using the state-of-the-art Sync Lipsync 2 Pro model.
sync-lipsync/v2/pro
video-to-video

Generate high-quality realistic lipsync animations from audio while preserving unique details like natural teeth and unique facial features using the state-of-the-art Sync Lipsync 2 Pro model.

animation
lip sync
high-quality
MMAudio generates synchronized audio given video and/or text inputs. It can be combined with video models to get videos with audio.
mmaudio-v2
video-to-video

MMAudio generates synchronized audio given video and/or text inputs. It can be combined with video models to get videos with audio.

ai video
fast
Wan-2.1 is a image-to-video model that generates high-quality videos with high visual quality and motion diversity from images
wan-i2v
image-to-video

Wan-2.1 is a image-to-video model that generates high-quality videos with high visual quality and motion diversity from images

image to video
motion
Showing 169 to 196 of 1355 results