Z-Image Turbo Image to Image

fal-ai/z-image/turbo/image-to-image
Generate images from text and images using Z-Image Turbo, Tongyi-MAI's super-fast 6B model.
Inference
Commercial use
Schema

Input

Additional Settings

Customize your input with more control.

Result

Idle

What would you like to do next?

Your request will cost $0.005 per megapixel.

Logs

Z-Image Turbo | [image-to-image]

Tongyi-MAI's Z-Image Turbo delivers image-to-image generation at $0.005 per megapixel through a 6-billion parameter architecture. Trading raw parameter count for inference optimization, this model processes transformations in 8 steps or fewer while maintaining commercial-grade output quality. Built for developers who need cost-effective image modification at scale without sacrificing control over the transformation process.

Built for: Product variant generation | Style transfer workflows | Rapid prototyping iterations


Performance That Scales

At $0.005 per megapixel, Z-Image Turbo positions 10-15x more cost-effectively than premium image generation alternatives while maintaining the flexibility of adjustable inference steps.

MetricResultContext
Inference Steps1-8 stepsConfigurable via API, default 8 steps balances quality and speed
Cost per Megapixel$0.005200 megapixels per $1.00 on fal
Batch SizeUp to 4 imagesPer request via `num_images` parameter
Strength Range0.0-1.0Default 0.6, lower values preserve more source structure
Related EndpointsZ-Image Turbo LoRALoRA variant for custom style training

Image Transformation With Strength Control

Z-Image Turbo uses a diffusion-based architecture optimized for image-to-image conditioning, where you provide both a reference image and a text prompt to guide the transformation. Unlike pure text-to-image models that start from noise, this approach preserves structural elements from your input while applying the changes you specify.

What this means for you:

  • Adjustable transformation intensity: Control how much the output diverges from your source image via the strength parameter (0-1 range), letting you dial in anything from subtle refinements to dramatic reimaginings
  • Multi-image batch processing: Generate up to 4 variations per request, useful for A/B testing different prompt variations or exploring creative options without separate API calls
  • Flexible resolution handling: Auto-sizing adapts to your input dimensions, with support for custom image sizes to match your workflow requirements
  • Accelerated inference options: Three acceleration levels (none, regular, high) let you trade generation time for cost based on your use case, prototype fast, then refine at full quality

Technical Specifications

SpecDetails
ArchitectureZ-Image Turbo
Input FormatsImage URL (JPEG, PNG, WebP, GIF, AVIF) + text prompt
Output FormatsJPEG, PNG, WebP (configurable via `output_format`)
Model Size6B parameters
LicenseCommercial use enabled

API Documentation | Quickstart Guide | Pricing


How It Stacks Up

Z-Image Turbo LoRA – Z-Image Turbo provides the base transformation engine at $0.005/megapixel, while the LoRA variant adds custom style training capabilities for specialized visual treatments. The LoRA endpoint trades base model simplicity for fine-tuned control when you need consistent brand aesthetics or specific artistic styles across multiple generations.

FASHN Virtual Try-On V1.5 – Z-Image Turbo handles general-purpose image transformation with flexible prompt control for $0.005/megapixel. FASHN specializes in garment placement and fit visualization for e-commerce workflows where product accuracy matters more than creative flexibility.

Image Editing endpoints (Age Progression, Wojak Style) – Z-Image Turbo offers broad transformation flexibility through natural language prompts, while specialized editing endpoints provide single-function transformations optimized for specific use cases. Choose Z-Image Turbo when you need multi-purpose image modification without switching between task-specific models.