The reframe endpoint intelligently adjusts an image's aspect ratio while preserving the main subject's position, composition, pose, and perspective
image-editing/reframe
image-to-image

The reframe endpoint intelligently adjusts an image's aspect ratio while preserving the main subject's position, composition, pose, and perspective

stylized
transform
Text to Speech Endpoint for Inworld's TTS-1.5 Max.
inworld-tts
text-to-speech

Text to Speech Endpoint for Inworld's TTS-1.5 Max.

inworld
tts
Wan 2.6 image-to-video flash model.
wan/v2.6/image-to-video/flash
image-to-video

Wan 2.6 image-to-video flash model.

MMAudio generates synchronized audio given text inputs. It can generate sounds described by a prompt.
mmaudio-v2/text-to-audio
text-to-audio

MMAudio generates synchronized audio given text inputs. It can generate sounds described by a prompt.

audio
fast
Run any video-capable LLM with fal. Analyze, summarize, and understand video files using Gemini (Google) models. Supports mp4, mpeg, mov, webm, and YouTube links. Powered by OpenRouter.
openrouter/router/video
video-to-text

Run any video-capable LLM with fal. Analyze, summarize, and understand video files using Gemini (Google) models. Supports mp4, mpeg, mov, webm, and YouTube links. Powered by OpenRouter.

LoRA inference endpoint for Qwen Image 2512, an improved version of Qwen Image with better text rendering, finer natural textures, and more realistic human generation.
qwen-image-2512/lora
text-to-image

LoRA inference endpoint for Qwen Image 2512, an improved version of Qwen Image with better text rendering, finer natural textures, and more realistic human generation.

qwen
2512
lora
Generate video clips from your images using Kling 1.5 (pro)
kling-video/v1.5/pro/image-to-video
image-to-video

Generate video clips from your images using Kling 1.5 (pro)

ImagineArt 1.5 Pro is an advanced text-to-image model that creates ultra-high-fidelity 4K visuals with lifelike realism, refined aesthetics, and powerful creative output suited for professional use.
imagineart/imagineart-1.5-pro-preview/text-to-image
text-to-image

ImagineArt 1.5 Pro is an advanced text-to-image model that creates ultra-high-fidelity 4K visuals with lifelike realism, refined aesthetics, and powerful creative output suited for professional use.

visuals
imagineart
realism
Moondream 3 is a vision language model that brings frontier-level visual reasoning with native object detection, pointing, and OCR capabilities to real-world applications requiring fast, inexpensive inference at scale.
moondream3-preview/caption
vision

Moondream 3 is a vision language model that brings frontier-level visual reasoning with native object detection, pointing, and OCR capabilities to real-world applications requiring fast, inexpensive inference at scale.

vision
Fast LoRA trainer for Z-Image-Turbo, a super fast text-to-image model of 6B parameters developed by Tongyi-MAI.
z-image-turbo-trainer-v2
training

Fast LoRA trainer for Z-Image-Turbo, a super fast text-to-image model of 6B parameters developed by Tongyi-MAI.

lora
personalization
trainer
Isolate audio tracks using ElevenLabs advanced audio isolation technology.
elevenlabs/audio-isolation
audio-to-audio

Isolate audio tracks using ElevenLabs advanced audio isolation technology.

audio
Image-to-image editing with LoRA support for FLUX.2 [klein] 4B Base from Black Forest Labs. Specialized style transfer and domain-specific modifications.
flux-2/klein/4b/base/edit/lora
image-to-image

Image-to-image editing with LoRA support for FLUX.2 [klein] 4B Base from Black Forest Labs. Specialized style transfer and domain-specific modifications.

Restore old or damaged photos by fixing colors, scratches, and resolution.
image-apps-v2/photo-restoration
image-to-image

Restore old or damaged photos by fixing colors, scratches, and resolution.

photo-restoration
image-enhance
Wan 2.5 image-to-image model.
wan-25-preview/image-to-image
image-to-image

Wan 2.5 image-to-image model.

Generate natural, clear speeches using Index TTS 2.0 from IndexTeam
index-tts-2/text-to-speech
text-to-speech

Generate natural, clear speeches using Index TTS 2.0 from IndexTeam

Reimagine existing images with Ideogram V3's remix feature. Create variations and adaptations while preserving core elements and adding new creative directions through prompt guidance.
ideogram/v3/remix
image-to-image

Reimagine existing images with Ideogram V3's remix feature. Create variations and adaptations while preserving core elements and adding new creative directions through prompt guidance.

realism
typography
Generate high quality video clips from text and image prompts using PixVerse v4.5
pixverse/v4.5/image-to-video
image-to-video

Generate high quality video clips from text and image prompts using PixVerse v4.5

stylized
transform
FLUX1.1 [pro] Redux is a high-performance endpoint for the FLUX1.1 [pro] model that enables rapid transformation of existing images, delivering high-quality style transfers and image modifications with the core FLUX capabilities.
flux-pro/v1.1/redux
image-to-image

FLUX1.1 [pro] Redux is a high-performance endpoint for the FLUX1.1 [pro] model that enables rapid transformation of existing images, delivering high-quality style transfers and image modifications with the core FLUX capabilities.

style transfer
Extend videos with xAI's Grok Imagine video model
xai/grok-imagine-video/extend-video
video-to-video

Extend videos with xAI's Grok Imagine video model

video-edit
v2v
grok
Generate video clips from your images using MiniMax Video model
minimax/video-01-live/image-to-video
image-to-video

Generate video clips from your images using MiniMax Video model

motion
transformation
Unified image generation with HiDream-O1-Image. Create, edit, and personalize high-resolution images up to 2K—single native model handles text-to-image, editing, and custom subjects without external components.
new
hidream-o1-image/dev/edit
image-to-image

Unified image generation with HiDream-O1-Image. Create, edit, and personalize high-resolution images up to 2K—single native model handles text-to-image, editing, and custom subjects without external components.

MiniMax Hailuo-2.3 Text To Video API (Pro, 1080p): Advanced text-to-video generation model with 1080p resolution
minimax/hailuo-2.3/pro/text-to-video
text-to-video

MiniMax Hailuo-2.3 Text To Video API (Pro, 1080p): Advanced text-to-video generation model with 1080p resolution

Generate audio from input videos using Kling
kling-video/video-to-audio
video-to-audio

Generate audio from input videos using Kling

LoRA endpoint for the Qwen Image Edit Plus model.
qwen-image-edit-plus-lora
image-to-image

LoRA endpoint for the Qwen Image Edit Plus model.

image-editing
Recraft V4 was developed with designers to bring true visual taste to AI image generation. Built for brand systems and production-ready workflows, it goes beyond prompt accuracy — delivering stronger composition, refined lighting, realistic materials, and a cohesive aesthetic. The result is imagery shaped by professional design judgment, ready for immediate real-world use without additional post-processing.
recraft/v4/pro/text-to-vector
text-to-image

Recraft V4 was developed with designers to bring true visual taste to AI image generation. Built for brand systems and production-ready workflows, it goes beyond prompt accuracy — delivering stronger composition, refined lighting, realistic materials, and a cohesive aesthetic. The result is imagery shaped by professional design judgment, ready for immediate real-world use without additional post-processing.

text-to-vector
FLUX Control LoRA Canny is a high-performance endpoint that uses a control image to transfer structure to the generated image, using a Canny edge map.
flux-control-lora-canny
text-to-image

FLUX Control LoRA Canny is a high-performance endpoint that uses a control image to transfer structure to the generated image, using a Canny edge map.

lora
style transfer
Text-to-image generation with LoRA support for FLUX.2 [klein] 9B Base from Black Forest Labs. Custom style adaptation and fine-tuned model variations.
flux-2/klein/9b/base/lora
text-to-image

Text-to-image generation with LoRA support for FLUX.2 [klein] 9B Base from Black Forest Labs. Custom style adaptation and fine-tuned model variations.

Generate video clips from your images using MiniMax Video model
minimax/video-01/image-to-video
image-to-video

Generate video clips from your images using MiniMax Video model

motion
transformation
Showing 449 to 476 of 1354 results