
VACE is a video generation model that uses a source image, mask, and video to create prompted videos with controllable sources.

Generate long videos in 720p/30fps from text using LongCat Video

Generate long speech snippets fast using Microsoft's powerful TTS.

Start with a simple text input to create dynamic generations that defy expectations. Anything you dream can come to life with sharp details, impressive character control and cinematic camera moves.

Generate Images with ControlNet.

Generate 3D models from your images using Hunyuan 3D. A native 3D generative model enabling versatile and high-quality 3D asset creation.

Train custom LoRAs for Wan-2.2 T2V/I2V 480P

Edit images with natural language

Removes harsh shadows and light spots from images, replacing them with soft, even, natural-looking illumination.

Anime finetune of Würstchen V3.
![Train styles, people and other subjects at blazing speeds using the FLUX.1 Krea [dev] base model.](https://refinery.fal.media/url/https%3A%2F%2Fv3.fal.media%2Ffiles%2Frabbit%2FuKINGMekBEYrVNUULujts_RVU-Kvlhsr5rEwqG7Uc-s_56e80afe7c1243d5a2f5eed5868ae63d.jpg/tr:w-1920,q-80/uKINGMekBEYrVNUULujts_RVU-Kvlhsr5rEwqG7Uc-s_56e80afe7c1243d5a2f5eed5868ae63d.webp)
Train styles, people and other subjects at blazing speeds using the FLUX.1 Krea [dev] base model.

Apply film grain effect with different styles (modern, analog, kodak, fuji, cinematic, newspaper) and customizable intensity and scale

Generate videos from prompts using LTX Video-0.9.7 13B and custom LoRA

Get waveform data from audio files using FFmpeg API.

High-quality text-to-image model by Baidu. Supports English, Chinese, and Japanese prompts with built-in prompt expansion.

Erase unwanted objects, people, or elements from video with a text prompt. High-fidelity output with strong temporal consistency, trained on licensed data for safe commercial use.

Hunyuan Video is an Open video generation model with high visual quality, motion diversity, text-video alignment, and generation stability. Use this endpoint to generate videos from videos.

MAGI-1 distilled generates videos faster from images with exceptional understanding of physical interactions and prompting

Qwen Image LoRA training

Convert your assets into lottie using Omnilottie.

Generate video with audio from audio, text and images using LTX-2 and custom LoRA

Generate seamlessly tiling photorealistic images from text using Z-Image Turbo and custom LoRA

Generate high quality and fast video clips from text and image prompts using PixVerse v4.5 fast

Ballpoint pen sketch drawing style

YuE is a groundbreaking series of open-source foundation models designed for music generation, specifically for transforming lyrics into full songs.

Use the capabilities of hunyuan part to generate point clouds from your 3D files.

F Lite is a 10B parameter diffusion model created by Fal and Freepik, trained exclusively on copyright-safe and SFW content.

Remove existing lighting and apply soft, even illumination