Generate realistic images.
realistic-vision
text-to-image

Generate realistic images.

realism
diffusion
 Run any audio capable LLM with fal. Process audio files — transcription, analysis, understanding, understand— using Gemini (Google) models. Supports wav, mp3, aiff, aac, ogg, flac, m4a. Powered by OpenRouter.
openrouter/router/audio
unknown

Run any audio capable LLM with fal. Process audio files — transcription, analysis, understanding, understand— using Gemini (Google) models. Supports wav, mp3, aiff, aac, ogg, flac, m4a. Powered by OpenRouter.

Kandinsky 5.0 Pro is a diffusion model for fast, high-quality image-to-video generation.
kandinsky5-pro/image-to-video
image-to-video

Kandinsky 5.0 Pro is a diffusion model for fast, high-quality image-to-video generation.

A high-quality British English text-to-speech model offering natural and expressive voice synthesis.
kokoro/british-english
text-to-audio

A high-quality British English text-to-speech model offering natural and expressive voice synthesis.

speech
Text-to-Image endpoint for Qwen-Image-Max. Qwen Image Max improves upon the Qwen Image Plus series by enhancing the realism and naturalness of images.
qwen-image-max/text-to-image
text-to-image

Text-to-Image endpoint for Qwen-Image-Max. Qwen Image Max improves upon the Qwen Image Plus series by enhancing the realism and naturalness of images.

qwen-image
max
Replace or dub audio on an existing video with fast audio-only lip-sync.
heygen/v3/lipsync/speed
video-to-video

Replace or dub audio on an existing video with fast audio-only lip-sync.

stylized
transform
lipsync
Wan 2.2's 14B model generates high-resolution, photorealistic images with powerful prompt understanding and fine-grained visual detail
wan/v2.2-a14b/text-to-image
text-to-image

Wan 2.2's 14B model generates high-resolution, photorealistic images with powerful prompt understanding and fine-grained visual detail

Create high-quality images with accurate text rendering and rich knowledge details—supports editing, style transfer, and maintaining consistent characters across multiple images.
glm-image
text-to-image

Create high-quality images with accurate text rendering and rich knowledge details—supports editing, style transfer, and maintaining consistent characters across multiple images.

Generate video clips from your prompts using Kling 1.5 (pro)
kling-video/v1.5/pro/text-to-video
text-to-video

Generate video clips from your prompts using Kling 1.5 (pro)

LTX-2.3 is a high-quality, fast AI video model available in Pro and Fast variants for text-to-video, image-to-video, and audio-to-video.
ltx-2.3/retake-video
video-to-video

LTX-2.3 is a high-quality, fast AI video model available in Pro and Fast variants for text-to-video, image-to-video, and audio-to-video.

stylized
transform
lipsync
Create illusions conditioned on image.
illusion-diffusion
text-to-image

Create illusions conditioned on image.

composition
stylized
SAM 3.1 builds comes with Object Multiplex, a shared-memory approach for joint multi-object tracking that delivers faster speeds with larger number of objects tracked.
sam-3-1/video
video-to-video

SAM 3.1 builds comes with Object Multiplex, a shared-memory approach for joint multi-object tracking that delivers faster speeds with larger number of objects tracked.

segmentation
mask
real-time
Generate high quality video clips from text and image prompts using PixVerse v4.5
pixverse/v4.5/text-to-video
text-to-video

Generate high quality video clips from text and image prompts using PixVerse v4.5

stylized
transform
Generate video with audio from images using LTX-2.3
ltx-2.3-22b/image-to-video
image-to-video

Generate video with audio from images using LTX-2.3

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks
florence-2-large/referring-expression-segmentation
image-to-image

Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks

multimodal
vision
segmentation
Super fast text-to-image endpoint for the FLUX.1 Kontext [dev] model with LoRA support, enabling rapid and high-quality image generation using pre-trained LoRA adaptations for personalization, specific styles, brand identities, and product-specific outputs.
flux-kontext-lora/text-to-image
text-to-image

Super fast text-to-image endpoint for the FLUX.1 Kontext [dev] model with LoRA support, enabling rapid and high-quality image generation using pre-trained LoRA adaptations for personalization, specific styles, brand identities, and product-specific outputs.

A blazing fast FLUX dev LoRA trainer for subjects and styles.
turbo-flux-trainer
training

A blazing fast FLUX dev LoRA trainer for subjects and styles.

Apply artistic styles like impressionism, cubism, or surrealism to your images.
image-apps-v2/style-transfer
image-to-image

Apply artistic styles like impressionism, cubism, or surrealism to your images.

style-transfer
Image editing endpoint for Qwen-Image-Max. Qwen Image Max improves upon the Qwen Image Plus series by enhancing the realism and naturalness of images.
qwen-image-max/edit
image-to-image

Image editing endpoint for Qwen-Image-Max. Qwen Image Max improves upon the Qwen Image Plus series by enhancing the realism and naturalness of images.

qwen-image
max
Modify a face to look younger or older while keeping identity realistic.
image-apps-v2/age-modify
image-to-image

Modify a face to look younger or older while keeping identity realistic.

age-transformation
face-editing
Sana Sprint is a text-to-image model capable of generating 4K images with exceptional speed.
sana/sprint
text-to-image

Sana Sprint is a text-to-image model capable of generating 4K images with exceptional speed.

text to image
4k
high-speed
Latest object erasing model from Black forest labs. Remove undesired objects, texts from images.
new
flux-pro/v1/erase
image-to-image

Latest object erasing model from Black forest labs. Remove undesired objects, texts from images.

utility
editing
FLUX1.1 [pro] ultra fine-tuned is the newest version of FLUX1.1 [pro] with a fine-tuned LoRA, maintaining professional-grade image quality while delivering up to 2K resolution with improved photo realism.
flux-pro/v1.1-ultra-finetuned
text-to-image

FLUX1.1 [pro] ultra fine-tuned is the newest version of FLUX1.1 [pro] with a fine-tuned LoRA, maintaining professional-grade image quality while delivering up to 2K resolution with improved photo realism.

high-res
realism
Text-to-image generation with FLUX.2 [klein] 9B from Black Forest Labs and custom LoRA.
flux-2/klein/9b/lora
text-to-image

Text-to-image generation with FLUX.2 [klein] 9B from Black Forest Labs and custom LoRA.

Generate images from your prompts using Luma Photon Flash. Photon Flash is the most creative, personalizable, and intelligent visual models for creatives, bringing a step-function change in the cost of high-quality image generation.
luma-photon/flash
text-to-image

Generate images from your prompts using Luma Photon Flash. Photon Flash is the most creative, personalizable, and intelligent visual models for creatives, bringing a step-function change in the cost of high-quality image generation.

Generate film-grade videos from text prompts with native audio, up to 1080p and 15 seconds, using PixVerse C1.
pixverse/c1/text-to-video
text-to-video

Generate film-grade videos from text prompts with native audio, up to 1080p and 15 seconds, using PixVerse C1.

video-generation
pixverse
cinematic
Nucleus-Image is a text-to-image generation model built on a sparse mixture-of-experts (MoE) diffusion transformer architecture.
nucleus-image
text-to-image

Nucleus-Image is a text-to-image generation model built on a sparse mixture-of-experts (MoE) diffusion transformer architecture.

stylized
transform
typography
Fine-tune FLUX.2 [dev] from Black Forest Labs with custom datasets. Create specialized LoRA adaptations for specific styles and domains.
flux-2-trainer-v2
training

Fine-tune FLUX.2 [dev] from Black Forest Labs with custom datasets. Create specialized LoRA adaptations for specific styles and domains.

Showing 645 to 672 of 1354 results