Fast, reliable, and cost-efficient.

fal adapts to your usage, ensuring you only pay for the computing power you consume. It's cost-effective scalability at its best.

GPU Pricing

Deploy your own app on our competitively priced GPU fleet.

GPU Pricing

Competitive pricing for custom deployments. Get H100s from as low as $1.99/hr. Contact support@fal.ai to get started.

GPUVRAMPrice per hour*Price per second*
GPU H100 iconH100
80GB$1.89/h$0.0005/s
GPU H200 iconH200
141GB$2.10/h$0.0006/s
GPU A100 iconA100
40GB$0.99/h$0.0003/s
GPU B200 iconB200
184GBcontact uscontact us

*starting at

Output-Based Pricing

Models deployed by us are priced by the output they generate, like videos, images or tokens

Note: Some models not shown here may use GPU-based pricing even if they generate media.

Video Models

Video models are billed by output unit — per second or per video — depending on the model. The values shown represent output-based pricing.

ModelUnitPriceOutput per $1
Hunyuan Videovideo$0.43 videos
Kling 1.6 Pro Videovideo second$0.09511 video seconds
Kling 2 Master Videovideo second$0.284 video seconds
Alibaba Wan Videovideo$0.43 videos
MiniMax Video Livevideo$0.52 videos

* For a fair comparison, we've normalized these values to show approximate output per $1.

* Based on an estimated average video = 5 seconds at 720p. Actual output may vary based on model, resolution, and prompt complexity.

Image Models

Image models are billed by either image count or output size in megapixels (MP). All pricing here is normalized to 1MP to allow simple comparison.

ModelUnitPriceOutput per $1
FLUX.1 [dev]megapixel$0.02540 megapixels
FLUX.1 [schnell]megapixel$0.003333 megapixels
FLUX.1 [pro]megapixel$0.0520 megapixels
Stable Diffusion 3 - Mediumimage$0.03529 images

Output is based on 1MP images. Higher resolutions will be priced proportionally. All models listed here follow output-based pricing. Some other models may use GPU-based pricing depending on architecture.