fal-ai/z-image/turbo/image-to-image

Generate images from text and images using Z-Image Turbo, Tongyi-MAI's super-fast 6B model.

Learn more about Z-Image

Inference

Commercial use

Schema

LLMs

Playground API

Input

Prompt*

A young Asian woman with long, vibrant purple hair stands on a sunlit sandy beach, posing confidently with her left hand resting on her hip. She gazes directly at the camera with a neutral expression. A sleek black ribbon bow is tied neatly on the right side of her head, just above her ear. She wears a flowing white cotton dress with a fitted bodice and a flared skirt that reaches mid-calf, slightly lifted by a gentle sea breeze. The beach behind her features fine, pale golden sand with subtle footprints, leading to calm turquoise waves under a clear blue sky with soft, wispy clouds. The lighting is natural daylight, casting soft shadows to her left, indicating late afternoon sun. The horizon line is visible in the background, with a faint silhouette of distant dunes. Her skin tone is fair with a natural glow, and her facial features are delicately defined. The composition is centered on her figure, framed from mid-thigh up, with shallow depth of field blurring the distant waves slightly.

Type # to reference inputs.

Image URL*

Hint: Drag and drop image files from your computer, images from web pages, paste from clipboard (Ctrl/Cmd+V), or provide a URL. Accepted file types: jpg, jpeg, png, webp, gif, avif

Strength

Additional Settings

Customize your input with more control.

Result

Idle

What would you like to do next?

Download

{
  "images": [
    {
      "url": "https://storage.googleapis.com/falserverless/example_outputs/z-image-turbo-i2i-output.png",
      "height": 1728,
      "width": 992,
      "content_type": "image/png"
    }
  ],
  "prompt": ""
}

Your request will cost $0.005 per megapixel.

Logs

Z-Image Turbo | [image-to-image]

Tongyi-MAI's Z-Image Turbo delivers image-to-image generation at $0.005 per megapixel through a 6-billion parameter architecture. Trading raw parameter count for inference optimization, this model processes transformations in 8 steps or fewer while maintaining commercial-grade output quality. Built for developers who need cost-effective image modification at scale without sacrificing control over the transformation process.

Built for: Product variant generation | Style transfer workflows | Rapid prototyping iterations

Performance That Scales

At $0.005 per megapixel, Z-Image Turbo positions 10-15x more cost-effectively than premium image generation alternatives while maintaining the flexibility of adjustable inference steps.

Metric	Result	Context
Inference Steps	1-8 steps	Configurable via API, default 8 steps balances quality and speed
Cost per Megapixel	$0.005	200 megapixels per $1.00 on fal
Batch Size	Up to 4 images	Per request via `num_images` parameter
Strength Range	0.0-1.0	Default 0.6, lower values preserve more source structure
Related Endpoints	Z-Image Turbo LoRA	LoRA variant for custom style training

Image Transformation With Strength Control

Z-Image Turbo uses a diffusion-based architecture optimized for image-to-image conditioning, where you provide both a reference image and a text prompt to guide the transformation. Unlike pure text-to-image models that start from noise, this approach preserves structural elements from your input while applying the changes you specify.

What this means for you:

Adjustable transformation intensity: Control how much the output diverges from your source image via the strength parameter (0-1 range), letting you dial in anything from subtle refinements to dramatic reimaginings
Multi-image batch processing: Generate up to 4 variations per request, useful for A/B testing different prompt variations or exploring creative options without separate API calls
Flexible resolution handling: Auto-sizing adapts to your input dimensions, with support for custom image sizes to match your workflow requirements
Accelerated inference options: Three acceleration levels (none, regular, high) let you trade generation time for cost based on your use case, prototype fast, then refine at full quality

Technical Specifications

Spec	Details
Architecture	Z-Image Turbo
Input Formats	Image URL (JPEG, PNG, WebP, GIF, AVIF) + text prompt
Output Formats	JPEG, PNG, WebP (configurable via `output_format`)
Model Size	6B parameters
License	Commercial use enabled

API Documentation | Quickstart Guide | Pricing

How It Stacks Up

Z-Image Turbo LoRA – Z-Image Turbo provides the base transformation engine at $0.005/megapixel, while the LoRA variant adds custom style training capabilities for specialized visual treatments. The LoRA endpoint trades base model simplicity for fine-tuned control when you need consistent brand aesthetics or specific artistic styles across multiple generations.

FASHN Virtual Try-On V1.5 – Z-Image Turbo handles general-purpose image transformation with flexible prompt control for $0.005/megapixel. FASHN specializes in garment placement and fit visualization for e-commerce workflows where product accuracy matters more than creative flexibility.

Image Editing endpoints (Age Progression, Wojak Style) – Z-Image Turbo offers broad transformation flexibility through natural language prompts, while specialized editing endpoints provide single-function transformations optimized for specific use cases. Choose Z-Image Turbo when you need multi-purpose image modification without switching between task-specific models.