VEED Fabric 1.0 text-to-video API
veed/fabric-1.0/text
text-to-video

VEED Fabric 1.0 text-to-video API

lipsync
avatar
VACE is a video generation model that uses a source image, mask, and video to create prompted videos with controllable sources.
wan-vace-14b/reframe
video-to-video

VACE is a video generation model that uses a source image, mask, and video to create prompted videos with controllable sources.

reframe
Generate 3D models from your images using Hunyuan 3D. A native 3D generative model enabling versatile and high-quality 3D asset creation.
hunyuan3d/v2/mini/turbo
image-to-3d

Generate 3D models from your images using Hunyuan 3D. A native 3D generative model enabling versatile and high-quality 3D asset creation.

stylized
Remove background from videos filmed using chromakey, with automatic green spill suppression for clean, professional edges.
veed/video-background-removal/green-screen
video-to-video

Remove background from videos filmed using chromakey, with automatic green spill suppression for clean, professional edges.

Create group photos
qwen-image-edit-plus-lora-gallery/group-photo
image-to-image

Create group photos

stylized
transform
LoRA endpoint for the Qwen Image Edit 2509 model.
qwen-image-edit-2509-lora
image-to-image

LoRA endpoint for the Qwen Image Edit 2509 model.

image-editing
CSM (Conversational Speech Model) is a speech generation model from Sesame that generates RVQ audio codes from text and audio inputs.
csm-1b
text-to-audio

CSM (Conversational Speech Model) is a speech generation model from Sesame that generates RVQ audio codes from text and audio inputs.

conversational
text to speech
Generate 3D models from your images using Hunyuan 3D. A native 3D generative model enabling versatile and high-quality 3D asset creation.
hunyuan3d/v2/turbo
image-to-3d

Generate 3D models from your images using Hunyuan 3D. A native 3D generative model enabling versatile and high-quality 3D asset creation.

stylized
A high-fidelity capability for erasing unwanted objects, people, or visual elements from videos while maintaining aesthetic quality and temporal consistency
bria/bria_video_eraser/erase/prompt
video-to-video

A high-fidelity capability for erasing unwanted objects, people, or visual elements from videos while maintaining aesthetic quality and temporal consistency

bria
erase
OmniGen is a unified image generation model that can generate a wide range of images from multi-modal prompts. It can be used for various tasks such as Image Editing, Personalized Image Generation, Virtual Try-On, Multi Person Generation and more!
omnigen-v1
text-to-image

OmniGen is a unified image generation model that can generate a wide range of images from multi-modal prompts. It can be used for various tasks such as Image Editing, Personalized Image Generation, Virtual Try-On, Multi Person Generation and more!

multimodal
editing
try-on
Generate synced sounds for any video, and return it with its new sound track (like MMAudio)
mirelo-ai/sfx-v1/video-to-video
video-to-video

Generate synced sounds for any video, and return it with its new sound track (like MMAudio)

sfx
Vidu Image to Video generates high-quality videos with exceptional visual quality and motion diversity from a single image
vidu/image-to-video
image-to-video

Vidu Image to Video generates high-quality videos with exceptional visual quality and motion diversity from a single image

motion
image to video
Bring speech to your texts using Qwen3-TTS Custom-Voice model with pre-trained voices or use your custom voice with Qwen3-TTS Clone Voice model
qwen-3-tts/text-to-speech/0.6b
text-to-speech

Bring speech to your texts using Qwen3-TTS Custom-Voice model with pre-trained voices or use your custom voice with Qwen3-TTS Clone Voice model

Endpoint for Qwen's Image Editing Plus model also known as Qwen-Image-Edit-2509. Has superior text editing capabilities and multi-image support.
qwen-image-edit-2509
image-to-image

Endpoint for Qwen's Image Editing Plus model also known as Qwen-Image-Edit-2509. Has superior text editing capabilities and multi-image support.

image-editing
high-quality-text
Precise camera position and angle control (rotation, zoom, vertical movement)
qwen-image-edit-2509-lora-gallery/multiple-angles
image-to-image

Precise camera position and angle control (rotation, zoom, vertical movement)

stylized
transform
Turbo is the model to use when you feel the need for speed. Turn your image to stunning video up to 3x faster – all with high quality outputs.
pika/v2/turbo/image-to-video
image-to-video

Turbo is the model to use when you feel the need for speed. Turn your image to stunning video up to 3x faster – all with high quality outputs.

editing
effects
animation
SAM 2 is a model for segmenting images and videos in real-time.
sam2/video
video-to-video

SAM 2 is a model for segmenting images and videos in real-time.

segmentation
mask
real-time
Kandinsky 5.0 Pro is a diffusion model for fast, high-quality text-to-video generation.
kandinsky5-pro/text-to-video
text-to-video

Kandinsky 5.0 Pro is a diffusion model for fast, high-quality text-to-video generation.

Audio separation with SAM Audio. Isolate any sound using natural language—professional-grade audio editing made simple for creators, researchers, and accessibility applications.
sam-audio/span-separate
audio-to-audio

Audio separation with SAM Audio. Isolate any sound using natural language—professional-grade audio editing made simple for creators, researchers, and accessibility applications.

sam-audio
Pika Effects are AI-powered video effects designed to modify objects, characters, and environments in a fun, engaging, and visually compelling manner.
pika/v1.5/pikaffects
image-to-video

Pika Effects are AI-powered video effects designed to modify objects, characters, and environments in a fun, engaging, and visually compelling manner.

editing
effects
animation
Fast, low-latency text-to-image model with high-quality output and full JSON-structured controllability. Open-source, trained on licensed data, and optimized for production-scale generation.
bria/fibo-lite/generate
text-to-image

Fast, low-latency text-to-image model with high-quality output and full JSON-structured controllability. Open-source, trained on licensed data, and optimized for production-scale generation.

bria
fibo
lite
Generate fast speech from text prompts and different voices using the MiniMax Speech-02 Turbo model, which leverages advanced AI techniques to create high-quality text-to-speech.
minimax/preview/speech-2.5-turbo
text-to-speech

Generate fast speech from text prompts and different voices using the MiniMax Speech-02 Turbo model, which leverages advanced AI techniques to create high-quality text-to-speech.

Lyra 2.0 is an image-to-video model that turns a single image into an explorable 3D-style video with camera-controlled motion.
lyra-2/zoom
image-to-video

Lyra 2.0 is an image-to-video model that turns a single image into an explorable 3D-style video with camera-controlled motion.

Reframe entire videos scene-by-scene using Wan VACE 2.1
wan-vace-apps/long-reframe
video-to-video

Reframe entire videos scene-by-scene using Wan VACE 2.1

Apply realistic makeup styles with adjustable intensity.
image-apps-v2/makeup-application
image-to-image

Apply realistic makeup styles with adjustable intensity.

makeup
transform
Photorealistic Image-to-Image
kolors/image-to-image
image-to-image

Photorealistic Image-to-Image

realism
editing
diffusion
Hunyuan World 1.0 turns a single image into a panorama or a 3D world. It creates realistic scenes from the image, allowing you to explore and view it from different angles.
hunyuan_world
image-to-image

Hunyuan World 1.0 turns a single image into a panorama or a 3D world. It creates realistic scenes from the image, allowing you to explore and view it from different angles.

Ray2 Flash Modify is a video generative model capable of restyling or retexturing the entire shot, from turning live-action into CG or stylized animation, to changing wardrobe, props, or the overall aesthetic and swap environments or time periods, giving you control over background, location, or even weather.
luma-dream-machine/ray-2-flash/modify
video-to-video

Ray2 Flash Modify is a video generative model capable of restyling or retexturing the entire shot, from turning live-action into CG or stylized animation, to changing wardrobe, props, or the overall aesthetic and swap environments or time periods, giving you control over background, location, or even weather.

modify
restyle
Showing 813 to 840 of 1354 results