Search Page 8

Showing 28 of 1400 results

Happy Horse 1.1 is Alibaba's #1-ranked video model. This image-to-video endpoint animates a still image into 1080p video with synchronized native audio and multilingual lip-sync

flux-2/turbo/edit

Image-to-image editing with FLUX.2 [dev] from Black Forest Labs. Precise modifications using natural language descriptions and hex color control—all at turbo speed.

image-to-image

Nano banana lite is the efficiency-focused model in the image generation family. Sub-2 second latency with cost-effective generation and editing, fast multi-turn local edits, and 14 supported aspect ratios.

new

google/nano-banana-lite

Nano banana lite is the efficiency-focused model in the image generation family. Sub-2 second latency with cost-effective generation and editing, fast multi-turn local edits, and 14 supported aspect ratios.

text-to-image

flux-lora/image-to-image

FLUX LoRA Image-to-Image is a high-performance endpoint that transforms existing images using FLUX models, leveraging LoRA adaptations to enable rapid and precise image style transfer, modifications, and artistic variations.

LatentSync is a video-to-video model that generates lip sync animations from audio using advanced algorithms for high-quality synchronization.

animation

lip sync

video-to-video

Kling 2.1 Master: The premium endpoint for Kling 2.1, designed for top-tier image-to-video generation with unparalleled motion fluidity, cinematic visuals, and exceptional prompt precision.

kling-video/v2.1/master/image-to-video

Kling 2.1 Master: The premium endpoint for Kling 2.1, designed for top-tier image-to-video generation with unparalleled motion fluidity, cinematic visuals, and exceptional prompt precision.

image-to-video

Seedance 2.0 Mini is a faster version of Seedance 2.0 that brings great performance and high generation speed at a lower cost.

new

bytedance/seedance-2.0/mini/text-to-video

Seedance 2.0 Mini is a faster version of Seedance 2.0 that brings great performance and high generation speed at a lower cost.

veed/lipsync

Generate realistic lipsync from any audio using VEED's model.

Get encoding metadata from video and audio files using FFmpeg API.

ffmpeg

json

Kling Omni 3: Top-tier image-to-image with flawless consistency.

kling-image/o3/image-to-image

Kling Omni 3: Top-tier image-to-image with flawless consistency.

image-to-image

Generate high-quality images, posters, and logos with Ideogram's latest V4.0q — producing crisp visuals with accurate text rendering, fine detail, and full creative control for polished, ready-to-use designs FRACTION OF A SECOND.

new

ideogram/v4/instant

Generate high-quality images, posters, and logos with Ideogram's latest V4.0q — producing crisp visuals with accurate text rendering, fine detail, and full creative control for polished, ready-to-use designs FRACTION OF A SECOND.

bytedance/seedance/v1/pro/text-to-video

Seedance 1.0 Pro, a high quality video generation model developed by Bytedance.

text-to-video

Gemini 3.1 Flash Image (a.k.a Nano Banana 2) is Google's new state-of-the-art fast image generation and editing model

gemini-3.1-flash-image-preview

Gemini 3.1 Flash Image (a.k.a Nano Banana 2) is Google's new state-of-the-art fast image generation and editing model

text-to-image

kokoro/american-english

Kokoro is a lightweight text-to-speech model that delivers comparable quality to larger models while being significantly faster and more cost-efficient.

speech

text-to-audio

sync-lipsync/v2/pro

Generate high-quality realistic lipsync animations from audio while preserving unique details like natural teeth and unique facial features using the state-of-the-art Sync Lipsync 2 Pro model.

Wan-2.1 is a image-to-video model that generates high-quality videos with high visual quality and motion diversity from images

Text-to-Image endpoint with LoRA support for Z-Image Turbo, a super fast text-to-image model of 6B parameters developed by Tongyi-MAI.

z-image/turbo/image-to-image

Generate images from text and images using Z-Image Turbo, Tongyi-MAI's super-fast 6B model.

ltx-2.3/image-to-video

LTX-2.3 is a high-quality, fast AI video model available in Pro and Fast variants for text-to-video, image-to-video, and audio-to-video.

flux-pro/kontext/multi

Experimental version of FLUX.1 Kontext [pro] with multi image handling capabilities

image-to-image

Generate realistic videos using Kling O3 from Kling Team!

kling-video/o3/pro/text-to-video

Generate realistic videos using Kling O3 from Kling Team!

text-to-video

FLUX LoRA training optimized for portrait generation, with bright highlights, excellent prompt following and highly detailed results.

flux-lora-portrait-trainer

FLUX LoRA training optimized for portrait generation, with bright highlights, excellent prompt following and highly detailed results.

lora

personalization

training

wan-25-preview/text-to-video

Wan 2.5 text-to-video model.

text-to-video

Transform and edit existing images with text-guided instructions using the WAN 2.7 model for creative image manipulation.

wan/v2.7/edit

Transform and edit existing images with text-guided instructions using the WAN 2.7 model for creative image manipulation.

OmniHuman generates video using an image of a human figure paired with an audio file. It produces vivid, high-quality videos where the character’s emotions and movements maintain a strong correlation with the audio.

Generate high-fidelity images from text with Krea 2 using a style reference image. Apply a reference image to guide the visual style into new generations, with aspect ratio, creativity, and seed controls.

Clone a voice from a sample audio and generate speech from text prompts using the MiniMax model, which leverages advanced AI techniques to create high-quality text-to-speech.

speech

text-to-speech

Image editing with FLUX.2 [flex] from Black Forest Labs. Supports multi-reference editing with customizable inference steps and enhanced text rendering.

flux-2-flex/edit

Image editing with FLUX.2 [flex] from Black Forest Labs. Supports multi-reference editing with customizable inference steps and enhanced text rendering.

image-to-image

Showing 197 to 224 of 1400 results