Run the latest models all in one Sandbox 🏖️

Birefnet Background Removal Image to Image

fal-ai/birefnet
bilateral reference framework (BiRefNet) for high-resolution dichotomous image segmentation (DIS)
Inference
Commercial use

Input

Additional Settings

Customize your input with more control.

Result

Idle

What would you like to do next?

Your request will cost $0 per compute second.

Logs

BiRefNet | [image-to-image]

BiRefNet's bilateral reference framework delivers high-resolution dichotomous image segmentation with precision mask generation. Trading traditional single-pass segmentation for a dual-reference architecture, it achieves cleaner edge detection and handles complex foreground-background separation. Purpose-built for production workflows requiring pixel-perfect transparency extraction from product photos, portraits, and complex scenes.

Use Cases: E-commerce Product Photography | Portrait Editing | Design Asset Preparation


Performance

BiRefNet operates at production-ready speeds with three specialized model variants optimized for different accuracy-speed tradeoffs, processing images up to 2048x2048 resolution with optional mask output for downstream compositing workflows.

MetricResultContext
Operating ResolutionUp to 2048x20484 megapixels max for high-fidelity edge detection
Model Variants3 specialized modelsLight (fast), Heavy (accurate), Portrait (optimized)
Cost per Inference$0 per compute secondPay only for actual processing time
Output FormatsPNG, WebP, GIFTransparency-preserving formats with optional mask export
Related EndpointsBiRefNet v2Enhanced accuracy variant for demanding segmentation tasks

Precision Segmentation Architecture

BiRefNet's bilateral reference framework processes images through parallel pathways, one analyzing global context, the other focusing on local detail, then synthesizes both for edge-accurate mask generation. This contrasts with standard single-encoder approaches that struggle with fine details like hair strands or transparent objects.

What this means for you:

  • Three-tier model selection: Choose Light (BiRefNet-DIS_ep580) for speed, Heavy (BiRefNet-massive-epoch_240) for complex scenes, or Portrait (BiRefNet-portrait-TR_P3M_10k-epoch_120) for face-optimized segmentation based on your accuracy requirements

  • Scalable resolution processing: Operate at 1024x1024 for standard workflows or 2048x2048 (4MP) for high-resolution source images requiring maximum edge fidelity

  • Optional foreground refinement: Enable `refine_foreground` to apply mask-guided enhancement that preserves subject detail while ensuring clean transparency

  • Dual output capability: Export both the background-removed image and the raw segmentation mask for manual compositing or downstream processing pipelines


Technical Specifications

SpecDetails
ArchitectureBiRefNet Bilateral Reference Framework
Input FormatsJPEG, PNG, WebP, GIF, AVIF via URL
Output FormatsPNG (default), WebP, GIF with alpha channel
Operating Resolutions1024x1024, 2048x2048
LicenseCommercial use permitted

API Documentation | Quickstart Guide | Enterprise Pricing


How It Stacks Up

BiRefNet v2 – BiRefNet v1 provides the core bilateral reference architecture with proven segmentation accuracy for general use cases. BiRefNet v2 builds on this foundation with enhanced edge detection refinement for challenging scenarios like fine hair detail or semi-transparent objects where the original model may struggle.