Z-Image Turbo Text to Image
Input
Customize your input with more control.
Logs
Z-Image Turbo LoRA [text-to-image]
Tongyi-MAI's Z-Image Turbo with LoRA support delivers 6B-parameter text-to-image generation at $0.0085 per megapixel with 8-step inference. Apply up to 3 custom LoRA weights at inference time without retraining, enabling style consistency and brand adaptation at approximately 118 generations per dollar on fal.
Built for: Custom style application | Brand-consistent generation | Character consistency workflows | Rapid design iteration
LoRA Flexibility Without Training Overhead
Z-Image Turbo LoRA adds inference-time style customization to the base model's speed advantages. Apply pre-trained LoRA adapters directly through the API, combining multiple style influences in a single generation call without touching the underlying model weights.
What this means for you:
- Apply up to 3 LoRA weights simultaneously: Combine custom styles, character adapters, and brand guidelines in a single generation request through the
`loras`parameter - Batch generation at scale: Generate up to 4 images per request with configurable inference steps (1-8 range), optimizing the speed-quality tradeoff for your use case
- Acceleration options: Choose between "none", "regular", or "high" acceleration modes to balance generation speed against output fidelity
- Production-ready safety: Built-in safety checker (enabled by default) filters NSFW content automatically, with optional prompt expansion for enhanced detail at +$0.0025 per request
- Flexible output formats: Generate images in JPEG, PNG, or WebP with landscape (4:3), portrait (3:4), or square (1:1) aspect ratios
Performance That Scales
Z-Image Turbo LoRA adds minimal overhead to base model pricing while enabling custom style application at inference time.
| Metric | Result | Context |
|---|---|---|
| Cost per Megapixel | $0.0085 | ~118 generations per $1.00 on fal |
| Inference Steps | 1-8 configurable | Default 8 steps balances speed and quality |
| Batch Capability | Up to 4 images | Single API call generates multiple variants |
| LoRA Support | Up to 3 concurrent weights | Apply custom styles without base model retraining |
| Base Model Cost | $0.005/MP | LoRA adds $0.0035/MP for style flexibility |
Technical Specifications
| Spec | Details |
|---|---|
| Architecture | Z-Image Turbo (6B parameters) with LoRA inference |
| Input Formats | Text prompts, optional LoRA weights (up to 3) |
| Output Formats | JPEG, PNG, WebP |
| Image Sizes | Landscape 4:3, Portrait 3:4, Square 1:1 |
| Acceleration Modes | None, Regular, High |
| Training | Z-Image Trainer for custom LoRA creation |
| License | Commercial use permitted |
API Documentation | Quickstart Guide
How It Stacks Up
Z-Image Turbo – The base Z-Image Turbo endpoint runs at $0.005/MP for maximum cost efficiency when style customization isn't required. The LoRA variant adds $0.0035/MP overhead to enable custom style application, character consistency, and brand adaptation at inference time.
AuraFlow – Z-Image Turbo LoRA prioritizes cost efficiency at $0.0085/MP with 8-step inference and runtime style customization. AuraFlow emphasizes open-source flexibility and longer inference paths for applications requiring maximum creative control and community-driven development.
FLUX.2 [dev] LoRA – Z-Image Turbo LoRA delivers comparable style customization at $0.0085/MP versus FLUX.2 [dev] LoRA's $0.021/MP, making it 2.5x more cost-efficient for high-volume workflows. FLUX.2 [dev] LoRA offers higher resolution outputs and more sophisticated prompt interpretation for applications where output quality justifies the premium.
