Z-Image Turbo: Fast Text-to-Image + LoRA Generations

Z-Image Turbo LoRA [text-to-image]

Tongyi-MAI's Z-Image Turbo with LoRA support delivers 6B-parameter text-to-image generation at $0.0085 per megapixel with 8-step inference. Apply up to 3 custom LoRA weights at inference time without retraining, enabling style consistency and brand adaptation at approximately 118 generations per dollar on fal.

Built for: Custom style application | Brand-consistent generation | Character consistency workflows | Rapid design iteration

LoRA Flexibility Without Training Overhead

Z-Image Turbo LoRA adds inference-time style customization to the base model's speed advantages. Apply pre-trained LoRA adapters directly through the API, combining multiple style influences in a single generation call without touching the underlying model weights.

What this means for you:

Apply up to 3 LoRA weights simultaneously: Combine custom styles, character adapters, and brand guidelines in a single generation request through the `loras` parameter
Batch generation at scale: Generate up to 4 images per request with configurable inference steps (1-8 range), optimizing the speed-quality tradeoff for your use case
Acceleration options: Choose between "none", "regular", or "high" acceleration modes to balance generation speed against output fidelity
Production-ready safety: Built-in safety checker (enabled by default) filters NSFW content automatically, with optional prompt expansion for enhanced detail at +$0.0025 per request
Flexible output formats: Generate images in JPEG, PNG, or WebP with landscape (4:3), portrait (3:4), or square (1:1) aspect ratios

Performance That Scales

Z-Image Turbo LoRA adds minimal overhead to base model pricing while enabling custom style application at inference time.

Metric	Result	Context
Cost per Megapixel	$0.0085	~118 generations per $1.00 on fal
Inference Steps	1-8 configurable	Default 8 steps balances speed and quality
Batch Capability	Up to 4 images	Single API call generates multiple variants
LoRA Support	Up to 3 concurrent weights	Apply custom styles without base model retraining
Base Model Cost	$0.005/MP	LoRA adds $0.0035/MP for style flexibility

Technical Specifications

Spec	Details
Architecture	Z-Image Turbo (6B parameters) with LoRA inference
Input Formats	Text prompts, optional LoRA weights (up to 3)
Output Formats	JPEG, PNG, WebP
Image Sizes	Landscape 4:3, Portrait 3:4, Square 1:1
Acceleration Modes	None, Regular, High
Training	Z-Image Trainer for custom LoRA creation
License	Commercial use permitted

API Documentation | Quickstart Guide

How It Stacks Up

Z-Image Turbo – The base Z-Image Turbo endpoint runs at $0.005/MP for maximum cost efficiency when style customization isn't required. The LoRA variant adds $0.0035/MP overhead to enable custom style application, character consistency, and brand adaptation at inference time.

AuraFlow – Z-Image Turbo LoRA prioritizes cost efficiency at $0.0085/MP with 8-step inference and runtime style customization. AuraFlow emphasizes open-source flexibility and longer inference paths for applications requiring maximum creative control and community-driven development.

FLUX.2 [dev] LoRA – Z-Image Turbo LoRA delivers comparable style customization at $0.0085/MP versus FLUX.2 [dev] LoRA's $0.021/MP, making it 2.5x more cost-efficient for high-volume workflows. FLUX.2 [dev] LoRA offers higher resolution outputs and more sophisticated prompt interpretation for applications where output quality justifies the premium.

fal-ai/z-image/turbo/lora

Input

Result

What would you like to do next?

Logs

Z-Image Turbo LoRA [text-to-image]

LoRA Flexibility Without Training Overhead

Performance That Scales

Technical Specifications

How It Stacks Up