Vace a video generation model that uses a source image, mask, and video to create prompted videos with controllable sources.
wan-vace-1-3b
video-to-video

Vace a video generation model that uses a source image, mask, and video to create prompted videos with controllable sources.

Extend an existing song
sonauto/v2/extend
audio-to-audio

Extend an existing song

music
text-to-music
text-to-audio
Replace sections of an existing audio with newly generated content
sonauto/v2/inpaint
text-to-audio

Replace sections of an existing audio with newly generated content

music
text-to-music
Create full songs in any style
sonauto/v2/text-to-music
text-to-audio

Create full songs in any style

music
text-to-music
Generate videos from reference images using Google's Veo 3.1 Fast
veo3.1/fast/reference-to-video
image-to-video

Generate videos from reference images using Google's Veo 3.1 Fast

Grok Imagine Pro is an advanced AI model from xAI that creates high-quality visuals from text prompts and allows you to edit or analyze existing images.
xai/grok-imagine-image/quality/text-to-image
text-to-image

Grok Imagine Pro is an advanced AI model from xAI that creates high-quality visuals from text prompts and allows you to edit or analyze existing images.

stylized
transform
typography
Grok Imagine Pro is an advanced AI model from xAI that creates high-quality visuals from text prompts and allows you to edit or analyze existing images.
xai/grok-imagine-image/quality/edit
image-to-image

Grok Imagine Pro is an advanced AI model from xAI that creates high-quality visuals from text prompts and allows you to edit or analyze existing images.

stylized
transform
typography
Realtime Try On experience with Decart Lucy 2.1 VTON
new
decart/lucy2-vton/realtime
video-to-video

Realtime Try On experience with Decart Lucy 2.1 VTON

SoulX-FlashHead is a unified 1.3B-parameter framework designed for high-fidelity, infinite-length, and real-time streaming portrait video generation.
flashhead
image-to-video

SoulX-FlashHead is a unified 1.3B-parameter framework designed for high-fidelity, infinite-length, and real-time streaming portrait video generation.

portrait
video
streaming
Audio-driven talking avatar generation powered by the SoulX-FlashTalk 14B model.
flashtalk
audio-to-video

Audio-driven talking avatar generation powered by the SoulX-FlashTalk 14B model.

avatar
talking-head
audio-driven
Lyria 3 is most recent music model from Google
lyria3
text-to-audio

Lyria 3 is most recent music model from Google

audio
music
sfx
Showing 1345 to 1355 of 1355 results