PhotoMaker

fal-ai/photomaker
Inference
Commercial use

1. Calling the API#

Install the client#

The client provides a convenient way to interact with the model API.

npm install --save @fal-ai/serverless-client

Setup your API Key#

Set FAL_KEY as an environment variable in your runtime.

export FAL_KEY="YOUR_API_KEY"

Submit a request#

The client API handles the API submit protocol. It will handle the request status updates and return the result when the request is completed.

import * as fal from "@fal-ai/serverless-client";

const result = await fal.subscribe("fal-ai/photomaker", {
  input: {
    image_archive_url: "https://storage.googleapis.com/falserverless/model_tests/photomaker/elon.zip",
    prompt: "instagram photo, portrait photo of a man img, colorful, perfect face, natural skin, hard shadows, film grain"
  },
  logs: true,
  onQueueUpdate: (update) => {
    if (update.status === "IN_PROGRESS") {
      update.logs.map((log) => log.message).forEach(console.log);
    }
  },
});

2. Authentication#

The API uses an API Key for authentication. It is recommended you set the FAL_KEY environment variable in your runtime when possible.

API Key#

In case your app is running in an environment where you cannot set environment variables, you can set the API Key manually as a client configuration.
import * as fal from "@fal-ai/serverless-client";

fal.config({
  credentials: "YOUR_FAL_KEY"
});

3. Files#

Some attributes in the API accept file URLs as input. Whenever that's the case you can pass your own URL or a Base64 data URI.

Data URI (base64)#

You can pass a Base64 data URI as a file input. The API will handle the file decoding for you. Keep in mind that for large files, this alternative although convenient can impact the request performance.

Hosted files (URL)#

You can also pass your own URLs as long as they are publicly accessible. Be aware that some hosts might block cross-site requests, rate-limit, or consider the request as a bot.

Uploading files#

We provide a convenient file storage that allows you to upload files and use them in your requests. You can upload files using the client API and use the returned URL in your requests.

import * as fal from "@fal-ai/serverless-client";

// Upload a file (you can get a file reference from an input element or a drag-and-drop event)
const file = new File(["Hello, World!"], "hello.txt", { type: "text/plain" });
const url = await fal.storage.upload(file);

// Use the URL in your request
const result = await fal.subscribe("fal-ai/photomaker", { image_url: url });

Read more about file handling in our file upload guide.

4. Schema#

Input#

image_archive_url*string

The URL of the image archive containing the images you want to use.

prompt*string

The prompt to use for generating the image. Be as descriptive as possible for best results.

base_pipelineBasePipelineEnum

The base pipeline to use for generating the image. Default value: "photomaker"

Possible values: "photomaker", "photomaker-style"

initial_image_urlstring

Optional initial image for img2img

initial_image_strengthfloat

How much noise to add to the latent image. O for no noise, 1 for maximum noise. Default value: 0.5

styleStyleEnum

Default value: "Photographic"

Possible values: "(No style)", "Cinematic", "Disney Character", "Digital Art", "Photographic", "Fantasy art", "Neonpunk", "Enhance", "Comic book", "Lowpoly", "Line art"

negative_promptstring

The negative prompt to use.Use it to address details that you don't want in the image. This could be colors, objects, scenery and even the small details (e.g. moustache, blurry, low resolution). Default value: ""

num_inference_stepsinteger

Increasing the amount of steps tells Stable Diffusion that it should take more steps to generate your final result which can increase the amount of detail in your image. Default value: 50

style_strengthinteger

Default value: 20

num_imagesinteger

Number of images to generate in one request. Note that the higher the batch size, the longer it will take to generate the images. Default value: 1

guidance_scalefloat

The CFG (Classifier Free Guidance) scale is a measure of how close you want the model to stick to your prompt when looking for a related image to show you. Default value: 5

seedinteger

The same seed and the same prompt given to the same version of Stable Diffusion will output the same image every time.

{
  "image_archive_url": "https://storage.googleapis.com/falserverless/model_tests/photomaker/elon.zip",
  "prompt": "instagram photo, portrait photo of a man img, colorful, perfect face, natural skin, hard shadows, film grain",
  "base_pipeline": "photomaker",
  "initial_image_strength": 0.5,
  "style": "Photographic",
  "negative_prompt": "nsfw, lowres, bad anatomy, bad hands, text, error, missing fingers, extra digit, fewer digits, cropped, worst quality, low quality, normal quality, jpeg artifacts, signature, watermark, username, blurry",
  "num_inference_steps": 50,
  "style_strength": 20,
  "num_images": 1,
  "guidance_scale": 5,
  "seed": 42
}

Output#

images*list<Image>
seed*integer
{
  "images": [
    {
      "file_size": 1785567,
      "height": 1024,
      "file_name": "87374b9db2b74f5792839b19d9b29a9a.png",
      "content_type": "image/png",
      "url": "https://storage.googleapis.com/falserverless/model_tests/photomaker/elon-output.png",
      "width": 1024
    }
  ]
}