Run the latest models all in one Sandbox 🏖️

Fooocus Text to Image

fal-ai/fooocus
Default parameters with automated optimizations and quality improvements.
Inference
Commercial use

Input

Additional Settings

Customize your input with more control.

Result

Idle

What would you like to do next?

Your request will cost $0 per compute second.

Logs

Fooocus | [text-to-image]

Fooocus delivers automated SDXL optimization with built-in quality enhancements. Built with intelligent defaults, this text-to-image model targets developers who need production-ready outputs without extensive prompt engineering. Built for rapid iteration cycles where consistent quality matters more than granular control.

Use Cases: Marketing Asset Generation | Product Visualization | Concept Art Development


Performance

Fooocus eliminates cost barriers for high-volume image generation workflows while maintaining SDXL-level quality through automated optimization layers.

MetricResultContext
ResolutionUp to 1024x1024Configurable aspect ratios with 8-pixel alignment requirement
Batch Generation1-4 images per requestParallel generation within single API call
Cost per Image$0Preview pricing, production rates TBD
Performance Modes4 speed tiersExtreme Speed, Lightning, Speed, Quality presets
Related EndpointsImage Prompt, Upscale or VaryReference-guided generation and resolution enhancement variants

Automated Optimization Without Parameter Overhead

Fooocus wraps SDXL architecture with pre-configured enhancement layers, eliminating the typical parameter tuning required for production-quality outputs. Where standard SDXL implementations require manual CFG scale adjustment, negative prompt crafting, and style preset selection, Fooocus ships with battle-tested defaults.

What this means for you:

  • Intelligent Style Layering: Fooocus Enhance + V2 + Sharp styles stack automatically, delivering detail preservation without manual LoRA weight balancing

  • Control Image Integration: Four control modes (ImagePrompt, PyraCanny, CPDS, FaceSwap) via single `control_type` parameter, no separate preprocessing pipelines required

  • LoRA Merging: Up to 5 LoRA models combine in a single request with weight control, replacing multi-step generation workflows

  • Performance Scaling: Four speed presets (Extreme Speed through Quality) trade inference time for detail density based on use case priority


Technical Specifications

SpecDetails
ArchitectureStable Diffusion XL
Input FormatsText prompts, control images (URL), LoRA models (up to 5)
Output FormatsPNG, JPEG, WebP
Resolution RangeCustom dimensions (8-pixel multiples) with 1024x1024 default
LicenseCommercial use enabled

API Documentation | Quickstart Guide | Enterprise Pricing


How It Stacks Up

Fooocus Image Prompt – The Image Prompt variant extends base Fooocus with reference image conditioning for style transfer workflows. Base Fooocus handles pure text-to-image generation where reference images aren't required, eliminating the image preprocessing step.

Fooocus Upscale or Vary – The Upscale or Vary endpoint trades generation flexibility for resolution enhancement and variation control. Base Fooocus remains ideal for initial generation where upscaling isn't part of the immediate workflow, reducing API call overhead.