
Generate music from text prompts using the MiniMax Music 2.0 model, which leverages advanced AI techniques to create high-quality, diverse musical compositions.

Google's famous original image generation and editing model, a.k.a Nano Banana

Generate video clips from your images using Kling 1.6 (pro)

Wan 2.7 is the latest generation AI video model, delivering enhanced motion smoothness, superior scene fidelity, and greater visual coherence.

Pixverse's latest V6 Model

Transform images, elements, and text into consistent, high-quality video scenes, ensuring stable character identity, object details, and environments.

Remove backgrounds from existing images with Ideogram's remove background feature. Isolate subjects cleanly for compositing and creative reuse.

Generate 1080p video with synchronized native audio from a text prompt. Aspect ratios: 16:9, 9:16, 1:1, 4:3, 3:4. Duration: 3–15s.

Generate Videos from images using Google's Veo 3.1

Generate videos with audio with Seedance 1.5

Endpoint for Qwen's Image Editing 2511 model.

Transfer movements from a reference video to any character image. Pro mode delivers higher quality output, ideal for complex dance moves and gestures.
![Image-to-image editing with FLUX.2 [klein] 9B from Black Forest Labs. Precise modifications using natural language descriptions and hex color control.](https://refinery.fal.media/url/https%3A%2F%2Fv3b.fal.media%2Ffiles%2Fb%2F0a8a7f50%2FX8ffS5h55gcigsNZoNC7O_52e6b383ac214d2abe0a2e023f03de88.jpg/tr:w-1920,q-80/X8ffS5h55gcigsNZoNC7O_52e6b383ac214d2abe0a2e023f03de88.webp)
Image-to-image editing with FLUX.2 [klein] 9B from Black Forest Labs. Precise modifications using natural language descriptions and hex color control.

Google’s highest quality image generation model

Generate high quality, realistic music with fine controls using Elevenlabs Music!

Converts a given raster image to SVG format using Recraft model.

Open source text-to-audio model.

Image-to-video endpoint for Sora 2 Pro, OpenAI's state-of-the-art video model capable of creating richly detailed, dynamic clips with audio from natural language or images.

Wan 2.5 image-to-video model.

Transfer movements from a reference video to any character image. Cost-effective mode for motion transfer, perfect for portraits and simple animations.

Compose videos from multiple media sources using FFmpeg API.

Frontier image editing model.
![Image-to-image editing with FLUX.2 [dev] from Black Forest Labs. Precise modifications using natural language descriptions and hex color control—in a flash.](https://refinery.fal.media/url/https%3A%2F%2Fv3b.fal.media%2Ffiles%2Fb%2F0a871484%2FfjLSktGKoWIGQWm-GRaUM_87cd94bbbff7400b830e73b8f6f075d4.jpg/tr:w-1920,q-80/fjLSktGKoWIGQWm-GRaUM_87cd94bbbff7400b830e73b8f6f075d4.webp)
Image-to-image editing with FLUX.2 [dev] from Black Forest Labs. Precise modifications using natural language descriptions and hex color control—in a flash.

sync-3 most powerful lipsync model yet, featuring native visual intelligence for professional-quality video.

Now with a 50% price drop. Generate videos from your image prompts using Veo 3 fast.

Generate videos using multiple reference images with xAI's Grok Imagine video model
![FLUX.2 [max] delivers state-of-the-art image generation and advanced image editing with exceptional realism, precision, and consistency.](https://refinery.fal.media/url/https%3A%2F%2Fv3b.fal.media%2Ffiles%2Fb%2F0a868a0f%2FzL7LNUIqnPPhZNy_PtHJq_330f66115240460788092cb9523b6aba.jpg/tr:w-1920,q-80/zL7LNUIqnPPhZNy_PtHJq_330f66115240460788092cb9523b6aba.webp)
FLUX.2 [max] delivers state-of-the-art image generation and advanced image editing with exceptional realism, precision, and consistency.
![The FLUX.1 Kontext [pro] text-to-image delivers state-of-the-art image generation results with unprecedented prompt following, photorealistic rendering, and flawless typography.](https://refinery.fal.media/url/https%3A%2F%2Ffal.media%2Ffiles%2Fzebra%2FVOrzt92hNVLX9m9jB-7-4_deea28b6b45344d4aa4eb3be14b3478e.jpg/tr:w-1920,q-80/VOrzt92hNVLX9m9jB-7-4_deea28b6b45344d4aa4eb3be14b3478e.webp)
The FLUX.1 Kontext [pro] text-to-image delivers state-of-the-art image generation results with unprecedented prompt following, photorealistic rendering, and flawless typography.