Generative
media platform for
developers_

import * as fal from "@fal-ai/serverless-client";

const result = await fal.subscribe("fal-ai/fast-sdxl", {
  input: {
    prompt: "photo of a cat wearing a kimono"
  },
  logs: true,
  onQueueUpdate: (update) => {
    if (update.status === "IN_PROGRESS") {
      update.logs.map((log) => log.message).forEach(console.log);
    }
  },
});

Fast, reliable, cheap. Choose 3.

fal.ai adapts to your usage, ensuring you only pay for the computing power you consume. It's cost-effective scalability at its best.

Choose a budget

$20.00

GPUA100

VRAM40GB

CPUs10

CPU Memory4GB

Price per second$0.00111/s

SDXL with defaults

With $20.00, run this model with 20 inference steps approximately 10,296 times.

That's about $0.00194 per inference.

View

~1.75sinference time

SDXL Lightning

With $20.00, run this model with 4 inference steps approximately 47,415 times.

That's about $0.00042 per inference.

View

~0.38sinference time

Whisper v3

With $20.00, run this model with a 10 minute audio clip approximately 3,677 times.

That's about $0.00544 per inference.

View

~4.9sinference time

GPUA6000

VRAM48GB

CPUs14

CPU Memory100GB

Price per second$0.000575/s

Generativemedia platform fordevelopers_

AuraFlow

Stable Diffusion V3

Stable Diffusion XL

AuraSR

Stable Cascade

Where developer experiencemeets the fastest AI

Real-time painless WebSocketinference infrastructure

Blazing fastfal Inference Engine™

Ready forprivate deployments

World-classdeveloper experience

Fast, reliable, cheap. Choose 3.

Generative
media platform for
developers_

Where developer experience
meets the fastest AI

Real-time painless WebSocket
inference infrastructure

Blazing fast
fal Inference Engine™

Ready for
private deployments

World-class
developer experience