A versatile endpoint for the FLUX.1 [dev] model that supports multiple AI extensions including LoRA, ControlNet conditioning, and IP-Adapter integration, enabling comprehensive control over image generation through various guidance methods.
flux-general
text-to-image

A versatile endpoint for the FLUX.1 [dev] model that supports multiple AI extensions including LoRA, ControlNet conditioning, and IP-Adapter integration, enabling comprehensive control over image generation through various guidance methods.

lora
controlnet
ip-adapter
MiniMax Hailuo-2.3 Image To Video API (Standard, 768p): Advanced image-to-video generation model with 768p resolution
minimax/hailuo-2.3/standard/image-to-video
image-to-video

MiniMax Hailuo-2.3 Image To Video API (Standard, 768p): Advanced image-to-video generation model with 768p resolution

Transform and edit existing images with text-guided instructions using the WAN 2.7 model for creative image manipulation.
wan/v2.7/edit
image-to-image

Transform and edit existing images with text-guided instructions using the WAN 2.7 model for creative image manipulation.

wan
image-editing
Edit videos using xAI's Grok Imagine
xai/grok-imagine-video/edit-video
video-to-video

Edit videos using xAI's Grok Imagine

video-edit
v2v
grok
MiniMax Hailuo-2.3-Fast Image To Video API (Standard, 768p): Advanced fast image-to-video generation model with 768p resolution
minimax/hailuo-2.3-fast/standard/image-to-video
image-to-video

MiniMax Hailuo-2.3-Fast Image To Video API (Standard, 768p): Advanced fast image-to-video generation model with 768p resolution

Qwen-Image-2.0 is a next-generation foundational unified generation-and-editing model
qwen-image-2/text-to-image
text-to-image

Qwen-Image-2.0 is a next-generation foundational unified generation-and-editing model

realism
typography
Generate high fidelity, studio quality videos of your avatar speaking or singing using the Aurora from Creatify team!
creatify/aurora
image-to-video

Generate high fidelity, studio quality videos of your avatar speaking or singing using the Aurora from Creatify team!

lipsync
Generate realistic videos using Kling O3 from Kling Team!
kling-video/o3/standard/text-to-video
text-to-video

Generate realistic videos using Kling O3 from Kling Team!

OpenAI's latest image generation and editing model: gpt-1-image.
gpt-image-1/edit-image
image-to-image

OpenAI's latest image generation and editing model: gpt-1-image.

Generate high-quality images from text prompts using the WAN 2.7 model with advanced prompt understanding and detailed output.
wan/v2.7/text-to-image
text-to-image

Generate high-quality images from text prompts using the WAN 2.7 model with advanced prompt understanding and detailed output.

wan
image-generation
FLUX.1 Kontext [max] text-to-image is a new premium model brings maximum performance across all aspects – greatly improved prompt adherence.
flux-pro/kontext/max/text-to-image
text-to-image

FLUX.1 Kontext [max] text-to-image is a new premium model brings maximum performance across all aspects – greatly improved prompt adherence.

Image-to-image editing with FLUX.2 [klein] 4B from Black Forest Labs. Precise modifications using natural language descriptions and hex color control.
flux-2/klein/4b/edit
image-to-image

Image-to-image editing with FLUX.2 [klein] 4B from Black Forest Labs. Precise modifications using natural language descriptions and hex color control.

SAM 3D enables precise 3D reconstruction of objects from real images, while accurately reconstructing their geometry and texture.
sam-3/3d-objects
image-to-3d

SAM 3D enables precise 3D reconstruction of objects from real images, while accurately reconstructing their geometry and texture.

3d
object
Qwen-Image-Layered is a model capable of decomposing an image into multiple RGBA layers.
qwen-image-layered
image-to-image

Qwen-Image-Layered is a model capable of decomposing an image into multiple RGBA layers.

qwen
layer
 Recraft V4.1 builds on the design-first foundation of V4 with sharper prompt control and cleaner composition. Tuned for brand systems and editorial work, it delivers production-ready raster images that hold up next to a designer's hand.
new
recraft/v4.1/text-to-image
text-to-image

Recraft V4.1 builds on the design-first foundation of V4 with sharper prompt control and cleaner composition. Tuned for brand systems and editorial work, it delivers production-ready raster images that hold up next to a designer's hand.

stylized
transform
typography
MiniMax Hailuo-02 Text To Video API (Standard, 768p): Advanced video generation model with 768p resolution
minimax/hailuo-02/standard/text-to-video
text-to-video

MiniMax Hailuo-02 Text To Video API (Standard, 768p): Advanced video generation model with 768p resolution

Run SDXL at the speed of light
fast-lightning-sdxl
text-to-image

Run SDXL at the speed of light

diffusion
lightning
real-time
Qwen Image 2512 is an improved version of Qwen Image with better text rendering, finer natural textures, and more realistic human generation.
qwen-image-2512
text-to-image

Qwen Image 2512 is an improved version of Qwen Image with better text rendering, finer natural textures, and more realistic human generation.

qwen
2512
Meshy-6 is the latest model from Meshy. It generates realistic and production ready 3D models.
meshy/v6/image-to-3d
image-to-3d

Meshy-6 is the latest model from Meshy. It generates realistic and production ready 3D models.

Endpoint for Qwen's Image Editing 2511 model with LoRa support.
qwen-image-edit-2511/lora
image-to-image

Endpoint for Qwen's Image Editing 2511 model with LoRa support.

stylized
transform
lora
Clone a voice from a sample audio and generate speech from text prompts using the MiniMax model, which leverages advanced AI techniques to create high-quality text-to-speech.
minimax/voice-clone
text-to-speech

Clone a voice from a sample audio and generate speech from text prompts using the MiniMax model, which leverages advanced AI techniques to create high-quality text-to-speech.

speech
MiniMax Hailuo-02 Text To Video API (Pro, 1080p): Advanced video generation model with 1080p resolution
minimax/hailuo-02/pro/text-to-video
text-to-video

MiniMax Hailuo-02 Text To Video API (Pro, 1080p): Advanced video generation model with 1080p resolution

Wan 2.6 image-to-image model.
wan/v2.6/image-to-image
image-to-image

Wan 2.6 image-to-image model.

Wan 2.7 is the latest generation AI video model, delivering enhanced motion smoothness, superior scene fidelity, and greater visual coherence.
wan/v2.7/text-to-video
text-to-video

Wan 2.7 is the latest generation AI video model, delivering enhanced motion smoothness, superior scene fidelity, and greater visual coherence.

stylized
transform
lipsync
Edit videos using Kling O3 from Kling Team!
kling-video/o3/standard/video-to-video/edit
video-to-video

Edit videos using Kling O3 from Kling Team!

Get encoding metadata from video and audio files using FFmpeg API.
ffmpeg-api/metadata
json

Get encoding metadata from video and audio files using FFmpeg API.

ffmpeg
Kling O3 Omni generates new shots guided by an input reference video, preserving cinematic language such as motion, and camera style to produce seamless scene continuity.
kling-video/o3/pro/video-to-video/reference
video-to-video

Kling O3 Omni generates new shots guided by an input reference video, preserving cinematic language such as motion, and camera style to produce seamless scene continuity.

Kling LipSync is an audio-to-video model that generates realistic lip movements from audio input.
kling-video/lipsync/audio-to-video
text-to-video

Kling LipSync is an audio-to-video model that generates realistic lip movements from audio input.

audio to video
lipsync
Showing 225 to 252 of 1354 results