Model Gallery

See all available model APIs provided by fal.ai
Text to Video

Veo 3

Veo 3 by Google, the most advanced AI video generation model in the world. Now available at fal with sound on!

Kling 2.1 Master

Kling 2.1 Master: The premium endpoint for Kling 2.1, designed for top-tier image-to-video generation with unparalleled motion fluidity, cinematic visuals, and exceptional prompt precision.

Category

All Models

Explore all available models provided by fal.ai

background texture
fal-ai/wan-pro/image-to-video
image-to-video

Wan-2.1 Pro is a premium image-to-video model that generates high-quality 1080p videos at 30fps with up to 6 seconds duration, delivering exceptional visual quality and motion diversity from images

image to video
motion
background texture
fal-ai/wan-i2v
image-to-video

Wan-2.1 is a image-to-video model that generates high-quality videos with high visual quality and motion diversity from images

image to video
motion
background texture
fal-ai/kling-video/v1.6/pro/image-to-video
image-to-video

Generate video clips from your images using Kling 1.6 (pro)

background texture
fal-ai/flux-lora-fast-training
training

Train styles, people and other subjects at blazing speeds.

lora
personalization
background texture
fal-ai/playai/tts/dialog
text-to-audio

Generate natural-sounding multi-speaker dialogues, and audio. Perfect for expressive outputs, storytelling, games, animations, and interactive media.

audio
background texture
fal-ai/flux-pro/v1.1-ultra
text-to-image

FLUX1.1 [pro] ultra is the newest version of FLUX1.1 [pro], maintaining professional-grade image quality while delivering up to 2K resolution with improved photo realism.

high-res
realism
background texture
fal-ai/recraft/v3/text-to-image
text-to-image

Recraft V3 is a text-to-image model with the ability to generate long texts, vector art, images in brand style, and much more. As of today, it is SOTA in image generation, proven by Hugging Face's industry-leading Text-to-Image Benchmark by Artificial Analysis.

vector
typography
style
background texture
fal-ai/minimax/video-01/image-to-video
image-to-video

Generate video clips from your images using MiniMax Video model

motion
transformation
background texture
fal-ai/minimax/hailuo-02/standard/text-to-video
text-to-video

MiniMax Hailuo-02 Text To Video API (Standard, 768p): Advanced video generation model with 768p resolution

new
background texture
fal-ai/kling-video/v2.1/standard/image-to-video
image-to-video

Kling 2.1 Standard is a cost-efficient endpoint for the Kling 2.1 model, delivering high-quality image-to-video generation

new
background texture
fal-ai/tavus/hummingbird-lipsync/v0
video-to-video

Generate lip sync using Tavus' state-of-the-art model for high-quality synchronization.

background texture
fal-ai/kling-video/v2/master/text-to-video
text-to-video

Generate video clips from your prompts using Kling 2.0 Master

background texture
fal-ai/hidream-i1-full
text-to-image

HiDream-I1 full is a new open-source image generative foundation model with 17B parameters that achieves state-of-the-art image generation quality within seconds.

background texture
fal-ai/hidream-i1-dev
text-to-image

HiDream-I1 dev is a new open-source image generative foundation model with 17B parameters that achieves state-of-the-art image generation quality within seconds.

background texture
fal-ai/hidream-i1-fast
text-to-image

HiDream-I1 fast is a new open-source image generative foundation model with 17B parameters that achieves state-of-the-art image generation quality within 16 steps.

background texture
fal-ai/flux/dev
text-to-image

FLUX.1 [dev] is a 12 billion parameter flow transformer that generates high-quality images from text. It is suitable for personal and commercial use.

background texture
fal-ai/mmaudio-v2
video-to-video

MMAudio generates synchronized audio given video and/or text inputs. It can be combined with video models to get videos with audio.

ai video
fast
background texture
fal-ai/ideogram/v2
text-to-image

Generate high-quality images, posters, and logos with Ideogram V2. Features exceptional typography handling and realistic outputs optimized for commercial and creative use.

realism
typography
background texture
fal-ai/flux-lora-portrait-trainer
training

FLUX LoRA training optimized for portrait generation, with bright highlights, excellent prompt following and highly detailed results.

lora
personalization
background texture
fal-ai/stable-diffusion-v35-large
text-to-image

Stable Diffusion 3.5 Large is a Multimodal Diffusion Transformer (MMDiT) text-to-image model that features improved performance in image quality, typography, complex prompt understanding, and resource-efficiency.

diffusion
typography
style
background texture
fal-ai/flux-lora/inpainting
text-to-image

Super fast endpoint for the FLUX.1 [dev] inpainting model with LoRA support, enabling rapid and high-quality image inpaingting using pre-trained LoRA adaptations for personalization, specific styles, brand identities, and product-specific outputs.

lora
personalization
background texture
fal-ai/flux-general
text-to-image

A versatile endpoint for the FLUX.1 [dev] model that supports multiple AI extensions including LoRA, ControlNet conditioning, and IP-Adapter integration, enabling comprehensive control over image generation through various guidance methods.

lora
controlnet
ip-adapter
background texture
fal-ai/flux-lora
text-to-image

Super fast endpoint for the FLUX.1 [dev] model with LoRA support, enabling rapid and high-quality image generation using pre-trained LoRA adaptations for personalization, specific styles, brand identities, and product-specific outputs.

lora
personalization
background texture
fal-ai/flux/dev/image-to-image
image-to-image

FLUX.1 Image-to-Image is a high-performance endpoint for the FLUX.1 [dev] model that enables rapid transformation of existing images, delivering high-quality style transfers and image modifications with the core FLUX capabilities.

style transfer
background texture
fal-ai/aura-sr
image-to-image

Upscale your images with AuraSR.

upscaling
high-res
background texture
fal-ai/clarity-upscaler
image-to-image

Clarity upscaler for upscaling images with high very fidelity.

upscaling
background texture
fal-ai/wan-vace-14b/reframe
video-to-video

VACE is a video generation model that uses a source image, mask, and video to create prompted videos with controllable sources.

new
image-to-video
video-to-video
text-to-video
background texture
fal-ai/wan-vace-14b/outpainting
video-to-video

VACE is a video generation model that uses a source image, mask, and video to create prompted videos with controllable sources.

new
image-to-video
video-to-video
text-to-video
background texture
fal-ai/wan-vace-14b/inpainting
video-to-video

VACE is a video generation model that uses a source image, mask, and video to create prompted videos with controllable sources.

new
image-to-video
video-to-video
text-to-video
background texture
fal-ai/wan-vace-14b/pose
video-to-video

VACE is a video generation model that uses a source image, mask, and video to create prompted videos with controllable sources.

new
image-to-video
video-to-video
text-to-video
background texture
fal-ai/wan-vace-14b/depth
video-to-video

VACE is a video generation model that uses a source image, mask, and video to create prompted videos with controllable sources.

new
image-to-video
video-to-video
text-to-video