Search Page 4

Showing 28 of 1400 results

aura-sr

Upscale your images with AuraSR.

upscaling

high-res

image-to-image

Train styles, people and other subjects at blazing speeds.

flux-lora-fast-training

Train styles, people and other subjects at blazing speeds.

lora

personalization

training

veo3.1

Veo 3.1 by Google, the most advanced AI video generation model in the world. With sound on!

text-to-video

kling-video/v2.5-turbo/pro/text-to-video

Kling 2.5 Turbo Pro: Top-tier text-to-video generation with unparalleled motion fluidity, cinematic visuals, and exceptional prompt precision.

Open source text-to-audio model.

music

text-to-audio

Transform images, elements, and text into consistent, high-quality video scenes, ensuring stable character identity, object details, and environments.

kling-video/o3/pro/reference-to-video

Transform images, elements, and text into consistent, high-quality video scenes, ensuring stable character identity, object details, and environments.

reference-to-video

image-to-video

Seedance 2.0 Mini is a faster version of Seedance 2.0 that brings great performance and high generation speed at a lower cost.

new

bytedance/seedance-2.0/mini/reference-to-video

Seedance 2.0 Mini is a faster version of Seedance 2.0 that brings great performance and high generation speed at a lower cost.

google/gemini-omni-flash/reference-to-video

Generates video with audio from combined multimodal references. Accepts text, images, audio, and video together as input to guide subject, motion, style, and sound in the output.

Run SDXL at the speed of light

minimax/hailuo-02/standard/image-to-video

MiniMax Hailuo-02 Image To Video API (Standard, 768p, 512p): Advanced image-to-video generation model with 768p and 512p resolutions

image-to-video

Seed Audio 1.0 is a new audio model from Bytedance that can generate high-quality, natural sounding audio using text, reference audios or an image.

new

bytedance/seed-audio-1.0

Seed Audio 1.0 is a new audio model from Bytedance that can generate high-quality, natural sounding audio using text, reference audios or an image.

text-to-audio

Edits generated video across multiple conversational turns while preserving scene coherence. Applies iterative changes through natural-language instructions without regenerating the full sequence from scratch.

new

google/gemini-omni-flash/edit

Edits generated video across multiple conversational turns while preserving scene coherence. Applies iterative changes through natural-language instructions without regenerating the full sequence from scratch.

elevenlabs/sound-effects/v2

Generate sound effects using ElevenLabs advanced sound effects model.

sound

text-to-audio

Wan 2.7 is the latest generation AI video model, delivering enhanced motion smoothness, superior scene fidelity, and greater visual coherence.

wan/v2.7/image-to-video

Wan 2.7 is the latest generation AI video model, delivering enhanced motion smoothness, superior scene fidelity, and greater visual coherence.

xai/grok-imagine-video/text-to-video

Generate videos with audio from text using Grok Imagine Video.

ffmpeg-api/merge-videos

Use ffmpeg capabilities to merge 2 or more videos.

video-to-video

Omnihuman v1.5 is a new and improved version of Omnihuman. It generates video using an image of a human figure paired with an audio file. It produces vivid, high-quality videos where the character’s emotions and movements maintain a strong correlation with the audio.

bytedance/omnihuman/v1.5

Omnihuman v1.5 is a new and improved version of Omnihuman. It generates video using an image of a human figure paired with an audio file. It produces vivid, high-quality videos where the character’s emotions and movements maintain a strong correlation with the audio.

lipsync

image-to-video

cassetteai/music-generator

CassetteAI’s model generates a 30-second sample in under 2 seconds and a full 3-minute track in under 10 seconds. At 44.1 kHz stereo audio, expect a level of professional consistency with no breaks, no squeaks, and no random interruptions in your creations.

music

cassetteai

text-to-audio

pixelcut/background-removal

Pixelcut’s Background Remover enables fast, ultra high-quality removal of backgrounds from images. Perfect for e-commerce and image editing workflows. Powered by advanced AI for clean, perfect cutouts every time.

kling-video/v2.6/standard/motion-control

Transfer movements from a reference video to any character image. Cost-effective mode for motion transfer, perfect for portraits and simple animations.

video-to-video

recraft/upscale/crisp

Enhances a given raster image using 'crisp upscale' tool, boosting resolution with a focus on refining small details and faces.

upscaling

image-to-image

Kling 2.5 Turbo Standard: Top-tier image-to-video generation with unparalleled motion fluidity, cinematic visuals, and exceptional prompt precision.

kling-video/v2.5-turbo/standard/image-to-video

Kling 2.5 Turbo Standard: Top-tier image-to-video generation with unparalleled motion fluidity, cinematic visuals, and exceptional prompt precision.

stylized

transform

image-to-video

Kling 2.1 Pro is an advanced endpoint for the Kling 2.1 model, offering professional-grade videos with enhanced visual fidelity, precise camera movements, and dynamic motion control, perfect for cinematic storytelling.

kling-video/v2.1/pro/image-to-video

Kling 2.1 Pro is an advanced endpoint for the Kling 2.1 model, offering professional-grade videos with enhanced visual fidelity, precise camera movements, and dynamic motion control, perfect for cinematic storytelling.

image-to-video

sync-lipsync/v2

Generate realistic lipsync animations from audio using advanced algorithms for high-quality synchronization with Sync Lipsync 2.0 model

animation

lip sync

video-to-video

Generates same scene from different angles (azimuth/elevation) with Qwen image Edit 2511 and the Lora Multiple Angles

qwen-image-edit-2511-multiple-angles

Generates same scene from different angles (azimuth/elevation) with Qwen image Edit 2511 and the Lora Multiple Angles

Text-to-image generation with FLUX.2 [dev] from Black Forest Labs. Enhanced realism, crisper text generation, and native editing capabilities—all at turbo speed.

text-to-image

kling-video/v3/pro/motion-control

Transfer movements from a reference video to any character image. Cost-effective mode for motion transfer, perfect for portraits and simple animations.

ideogram/remove-background

Remove backgrounds from existing images with Ideogram's remove background feature. Isolate subjects cleanly for compositing and creative reuse.

image-to-image

Showing 85 to 112 of 1400 results