
Place your subject in any scene you imagine, from enchanted forests to urban settings, with professional composition and lighting

SDXL with an alpha channel.

Generate video with audio from text using LTX-2.3 Distilled

Place a person’s photo into iconic cities worldwide.
![Use Qwen 3 Guard [8B] to detect and classify text as safe or harmful, delivering precise and reliable safety categorization.](https://refinery.fal.media/url/https%3A%2F%2Fv3b.fal.media%2Ffiles%2Fb%2Fkangaroo%2FmbftwyiNyy5hXuWPI8jdr_b9e8889f91994e318b188ac9db9089aa.jpg/tr:w-1920,q-80/mbftwyiNyy5hXuWPI8jdr_b9e8889f91994e318b188ac9db9089aa.webp)
Use Qwen 3 Guard [8B] to detect and classify text as safe or harmful, delivering precise and reliable safety categorization.

Create your imagined 3D models with just text. Production-ready, export-ready professional assets with realistic lighting and materials in minutes.

Transform images into 3D cartoon artwork using an AI model that applies cartoon stylization while preserving the original image's composition and details.

Train Ideogram on your photos, your style, your subject, your look, from a small set of reference images to images that feel consistently yours

Transform existing images with Ideogram V2's editing capabilities. Modify, adjust, and refine images while maintaining high fidelity and realistic outputs with precise prompt control.

A unified speech-language model that synchronizes speech and text into a single, cohesive stream via 1:1 alignment.

Framepack is an efficient Image-to-video model that autoregressively generates videos.

Bria's Text-to-Image model with perfect harmony of latency and quality. Trained exclusively on licensed data for safe and risk-free commercial use. Available also as source code and weights. For access to weights: https://bria.ai/contact-us
![Fine-tune FLUX.2 [dev] from Black Forest Labs with custom datasets. Create specialized LoRA adaptations for specific editing tasks.](https://refinery.fal.media/url/https%3A%2F%2Fv3b.fal.media%2Ffiles%2Fb%2Frabbit%2FQQxycBXjY75hch-HBAQKZ_4af8ba3ddb9d457ba5fc51fcd428e720.jpg/tr:w-1920,q-80/QQxycBXjY75hch-HBAQKZ_4af8ba3ddb9d457ba5fc51fcd428e720.webp)
Fine-tune FLUX.2 [dev] from Black Forest Labs with custom datasets. Create specialized LoRA adaptations for specific editing tasks.

MultiShotMaster is a controllable multi-shot narrative video generation framework that supports text-driven inter-shot consistency, variable shot counts and shot durations, customized subject with motion control, and background-driven customized scene.

Generate full portrait from a cropped face photo

Bria's Text-to-Image model, trained exclusively on licensed data for safe and risk-free commercial use. Available also as source code and weights. For access to weights: https://bria.ai/contact-us

Generate video from text and images using NVIDIA's 2B Cosmos Post-Trained Model

Enhance muffled 16 kHz speech audio into crystal-clear 48 kHz, with denoising for particularly bad inputs.

Fooocus extreme speed mode as a standalone app.

Interpolate images with RIFE - Real-Time Intermediate Flow Estimation

ffmpeg utility to interleave videos

Fix low resolution images with fast speed and quality of thera.

Create variations of existing images with Ideogram V2A Remix while maintaining creative control through prompt guidance.

Generate video with audio from videos using LTX-2.3

A highly efficient Mandarin Chinese text-to-speech model that captures natural tones and prosody.

Generate videos from prompts and images using LTX Video-0.9.7 13B and custom LoRA

An expressive and natural French text-to-speech model for both European and Canadian French.

Generate a video from a text prompt with Marey, a generative video model trained exclusively on fully licensed data.