
The reframe endpoint intelligently adjusts an image's aspect ratio while preserving the main subject's position, composition, pose, and perspective

Text to Speech Endpoint for Inworld's TTS-1.5 Max.

Wan 2.6 image-to-video flash model.

MMAudio generates synchronized audio given text inputs. It can generate sounds described by a prompt.

Run any video-capable LLM with fal. Analyze, summarize, and understand video files using Gemini (Google) models. Supports mp4, mpeg, mov, webm, and YouTube links. Powered by OpenRouter.

LoRA inference endpoint for Qwen Image 2512, an improved version of Qwen Image with better text rendering, finer natural textures, and more realistic human generation.

Generate video clips from your images using Kling 1.5 (pro)

ImagineArt 1.5 Pro is an advanced text-to-image model that creates ultra-high-fidelity 4K visuals with lifelike realism, refined aesthetics, and powerful creative output suited for professional use.

Moondream 3 is a vision language model that brings frontier-level visual reasoning with native object detection, pointing, and OCR capabilities to real-world applications requiring fast, inexpensive inference at scale.

Fast LoRA trainer for Z-Image-Turbo, a super fast text-to-image model of 6B parameters developed by Tongyi-MAI.
Isolate audio tracks using ElevenLabs advanced audio isolation technology.
![Image-to-image editing with LoRA support for FLUX.2 [klein] 4B Base from Black Forest Labs. Specialized style transfer and domain-specific modifications.](https://refinery.fal.media/url/https%3A%2F%2Fv3b.fal.media%2Ffiles%2Fb%2F0a8b09b3%2Fck_nRVKlUom4-4_5qfG7t_117a3ccf9f9541aeb83e5ffa75564e6d.jpg/tr:w-1920,q-80/ck_nRVKlUom4-4_5qfG7t_117a3ccf9f9541aeb83e5ffa75564e6d.webp)
Image-to-image editing with LoRA support for FLUX.2 [klein] 4B Base from Black Forest Labs. Specialized style transfer and domain-specific modifications.

Restore old or damaged photos by fixing colors, scratches, and resolution.

Wan 2.5 image-to-image model.

Generate natural, clear speeches using Index TTS 2.0 from IndexTeam

Reimagine existing images with Ideogram V3's remix feature. Create variations and adaptations while preserving core elements and adding new creative directions through prompt guidance.

Generate high quality video clips from text and image prompts using PixVerse v4.5
FLUX1.1 [pro] Redux is a high-performance endpoint for the FLUX1.1 [pro] model that enables rapid transformation of existing images, delivering high-quality style transfers and image modifications with the core FLUX capabilities.

Extend videos with xAI's Grok Imagine video model

Generate video clips from your images using MiniMax Video model

Unified image generation with HiDream-O1-Image. Create, edit, and personalize high-resolution images up to 2K—single native model handles text-to-image, editing, and custom subjects without external components.

MiniMax Hailuo-2.3 Text To Video API (Pro, 1080p): Advanced text-to-video generation model with 1080p resolution

Generate audio from input videos using Kling

LoRA endpoint for the Qwen Image Edit Plus model.

Recraft V4 was developed with designers to bring true visual taste to AI image generation. Built for brand systems and production-ready workflows, it goes beyond prompt accuracy — delivering stronger composition, refined lighting, realistic materials, and a cohesive aesthetic. The result is imagery shaped by professional design judgment, ready for immediate real-world use without additional post-processing.

FLUX Control LoRA Canny is a high-performance endpoint that uses a control image to transfer structure to the generated image, using a Canny edge map.
![Text-to-image generation with LoRA support for FLUX.2 [klein] 9B Base from Black Forest Labs. Custom style adaptation and fine-tuned model variations.](https://refinery.fal.media/url/https%3A%2F%2Fv3b.fal.media%2Ffiles%2Fb%2F0a8b09ab%2F3my7lbot7weIdE03-d5xc_2da235d3c4d14924b2c7a03f47e1bd65.jpg/tr:w-1920,q-80/3my7lbot7weIdE03-d5xc_2da235d3c4d14924b2c7a03f47e1bd65.webp)
Text-to-image generation with LoRA support for FLUX.2 [klein] 9B Base from Black Forest Labs. Custom style adaptation and fine-tuned model variations.

Generate video clips from your images using MiniMax Video model