Search Page 17

Showing 28 of 1395 results

wan/v2.6/image-to-video/flash

Wan 2.6 image-to-video flash model.

image-to-video

Clone your voices using Qwen3-TTS Clone-Voice model with zero shot cloning capabilities and use it on text-to-speech models to create speeches of yours!

qwen-3-tts/clone-voice/1.7b

Clone your voices using Qwen3-TTS Clone-Voice model with zero shot cloning capabilities and use it on text-to-speech models to create speeches of yours!

clone-voice

voice-clone

audio-to-audio

pixverse/v5.5/effects

Pixverse Effects

image-to-video

Ideogram V4.0q Image-to-Image transforms an input image with a text prompt, restyling and reworking the composition while preserving its core structure for prompt-faithful, high-fidelity edits.

ideogram/v4/image-to-image

Ideogram V4.0q Image-to-Image transforms an input image with a text prompt, restyling and reworking the composition while preserving its core structure for prompt-faithful, high-fidelity edits.

Generate 3D human motions via text-to-generation interface of Hunyuan Motion!

motion

text-to-3d

instant-character

InstantCharacter creates high-quality, consistent characters from text prompts, supporting diverse poses, styles, and appearances with strong identity control.

personalization

customization

image-to-image

kling-video/v1/pro/ai-avatar

Kling AI Avatar Pro: The premium endpoint for creating avatar videos with realistic humans, animals, cartoons, or stylized characters

Reimagine existing images with Ideogram V3's remix feature. Create variations and adaptations while preserving core elements and adding new creative directions through prompt guidance.

Generate music from text prompts using the MiniMax model, which leverages advanced AI techniques to create high-quality, diverse musical compositions.

music

text-to-audio

video-upscaler

The video upscaler endpoint uses RealESRGAN on each frame of the input video to upscale the video to a higher resolution.

tripo3d/h3.1/multiview-to-3d

Generate 3D models from multiple view images using Tripo H3.1.

multiview-to-3d

3d-generation

image-to-3d

openrouter/router/openai/v1/embeddings

Generate text embeddings using OpenAI-compatible API. Access embedding models like text-embedding-3-small, text-embedding-3-large (OpenAI), and other embedding models available through OpenRouter. Drop-in replacement for the OpenAI embeddings API. Powered by OpenRouter.

llm

hidream-i1-fast

HiDream-I1 fast is a new open-source image generative foundation model with 17B parameters that achieves state-of-the-art image generation quality within 16 steps.

text-to-image

flashvsr/upscale/video

Upscale your videos using FlashVSR with the fastest speeds!

upscale

video-to-video

Luma Uni-1 Edit reworks a source image from a text instruction, preserving the original composition while applying style changes and following optional reference images to steer the result.

luma/agent/uni-1/v1/edit

Luma Uni-1 Edit reworks a source image from a text instruction, preserving the original composition while applying style changes and following optional reference images to steer the result.

Create high-fidelity video with audio from images with LTX-2 Pro

image-to-video

Ray2 is a large-scale video generative model capable of creating realistic visuals with natural, coherent motion.

luma-dream-machine/ray-2

Ray2 is a large-scale video generative model capable of creating realistic visuals with natural, coherent motion.

Create creative upscaled images.

upscaling

image-to-image

Generate video with audio from images using LTX-2

ltx-2-19b/image-to-video

Generate video with audio from images using LTX-2

image-to-video

minimax/hailuo-2.3/pro/text-to-video

MiniMax Hailuo-2.3 Text To Video API (Pro, 1080p): Advanced text-to-video generation model with 1080p resolution

text-to-video

Modify consistent characters while preserving their core identity. Edit poses, expressions, or clothing without losing recognizable character features

ideogram/character/edit

Modify consistent characters while preserving their core identity. Edit poses, expressions, or clothing without losing recognizable character features

character-consistency

image-to-image

flux-control-lora-depth

FLUX Control LoRA Depth is a high-performance endpoint that uses a control image to transfer structure to the generated image, using a depth map.

lora

style transfer

text-to-image

Generate high quality video clips from text and image prompts using PixVerse v5

pixverse/v5/text-to-video

Generate high quality video clips from text and image prompts using PixVerse v5

text-to-video

hunyuan-3d/v3.1/rapid/image-to-3d

Rapidly generate 3D models from images using Hunyuan 3D.

hunyuan

image-to-3d

chatterbox/speech-to-speech

Whether you're working on memes, videos, games, or AI agents, Chatterbox brings your content to life. Use the first tts from resemble ai.

speech-to-speech

stable-diffusion-v3-medium

Stable Diffusion 3 Medium (Text to Image) is a Multimodal Diffusion Transformer (MMDiT) model that improves image quality, typography, prompt understanding, and efficiency.

diffusion

style

text-to-image