
Rapidly create image variations with Ideogram V2 Turbo Remix. Fast and efficient reimagining of existing images while maintaining creative control through prompt guidance.

GOT-OCR2 works on a wide range of tasks, including plain document OCR, scene text OCR, formatted document OCR, and even OCR for tables, charts, mathematical formulas, geometric shapes, molecular formulas and sheet music.

Generate long videos from images using LongCat Video Distilled
![FLUX.1 Krea [dev] Redux is a high-performance endpoint for the FLUX.1 Krea [dev] model that enables rapid transformation of existing images, delivering high-quality style transfers and image modifications with the core FLUX capabilities.](https://refinery.fal.media/url/https%3A%2F%2Fstorage.googleapis.com%2Ffal_cdn%2Ffal%2FUpscale.jpg/tr:w-1920,q-80/Upscale.webp)
FLUX.1 Krea [dev] Redux is a high-performance endpoint for the FLUX.1 Krea [dev] model that enables rapid transformation of existing images, delivering high-quality style transfers and image modifications with the core FLUX capabilities.

Kling LipSync is a text-to-video model that generates realistic lip movements from text input.

Leverage the rapid processing capabilities of AI models to enable accurate and efficient real-time speech-to-text transcription.

Add immersive sound effects and background music to your videos using PixVerse sound effects generation

Finegrain Eraser removes any object selected with a bounding box—along with its shadows, reflections, and lighting artifacts—seamlessly reconstructing the scene with contextually accurate content.

Generate video with audio from reference video, text and images using LTX-2.3

VACE Fun for Wan 2.2 A14B from Alibaba-PAI

FFMPEG Utility to Reverse Videos

ZoeDepth preprocessor.

Pull motion from a reference video and apply it to new subjects or scenes.

Change facial expressions in photos to any emotion you desire, from smiles to serious looks.

Transform any person into their baby version, while preserving the original pose and expression with childlike features.

LoRA trainer for Qwen Image Edit 2511

Vidu Q1 Text to Video generates high-quality 1080p videos with exceptional visual quality and motion diversity

Convert plain text into Fibo-Lite's transparent JSON-structured prompts - Bria's unique controllability layer that no closed model offers. Built for agentic and enterprise workflows.

Generate video clips from your prompts using Kling 1.6 (pro)

Change facial expressions in photos with realistic results.

Generate video from audio using LTX-2
![FLUX.1 SRPO [dev] is a 12 billion parameter flow transformer that generates high-quality images from text with incredible aesthetics. It is suitable for personal and commercial use.](https://refinery.fal.media/url/https%3A%2F%2Ffal.media%2Ffiles%2Fpanda%2FLw4P1PGZPkZkAPI3u_Mxt_709597e8d0024e10ab25dfdf31963d0a.jpg/tr:w-1920,q-80/Lw4P1PGZPkZkAPI3u_Mxt_709597e8d0024e10ab25dfdf31963d0a.webp)
FLUX.1 SRPO [dev] is a 12 billion parameter flow transformer that generates high-quality images from text with incredible aesthetics. It is suitable for personal and commercial use.

FFMPEG Untility for Extracting nth Frame

Extend video with audio using LTX-2

Generate video with audio from audio, text and images using LTX-2 Distilled

Generate video with audio from text using LTX-2.3 and custom LoRA

Use the latest Vidu Q2 Pro models which much more better quality and control on your videos.

Wan 2.2 text to image LoRA trainer. Fine-tune Wan 2.2 for subjects and styles with unprecedented detail.