Longcat Image Image to Image

fal-ai/longcat-image/edit
LongCat image Edit is a 6B parameter image editing model excelling at multilingual text rendering, photorealism and deployment efficiency.
Inference
Commercial use

Input

Additional Settings

Customize your input with more control.

Result

Idle

What would you like to do next?

Your request will cost $0.15 per megapixel.

Logs

Longcat Image | [image-to-image]

Longcat Image Edit is a 6B parameter image editing model delivering natural language transformations at $0.15 per megapixel. Trading raw generation speed for context-aware editing precision, it handles multilingual text rendering and photorealistic modifications without requiring masks or layers. Built for developers who need semantic understanding in their editing pipeline, not just pixel manipulation.

Built for: Natural Language Photo Edits | Multilingual Text Overlays | Context-Aware Image Transformations


Performance That Scales

At $0.15 per megapixel, Longcat Image positions between budget batch processors and premium editing APIs, prioritizing semantic understanding over raw throughput.

MetricResultContext
Parameter Count6BMid-size architecture balancing quality and inference cost
Inference Steps1-50 (default 28)Configurable quality/speed tradeoff via API parameters
Cost per Edit$0.15/megapixelScales with output resolution, not complexity
Batch Capacity1-4 imagesParallel processing for workflow efficiency
Acceleration Modesnone/regular/highHardware optimization tiers for latency-sensitive applications

Instruction-First Editing Without Masking

Longcat Image processes natural language editing commands directly against reference images, eliminating the traditional mask-layer-apply workflow. The 6B parameter architecture interprets spatial relationships and contextual intent, where "add elegant cursive text with lightning streaks at the top" executes as a single atomic operation.

What this means for you:

  • Multilingual text rendering: Handles non-Latin scripts and complex typography without font management overhead
  • Photorealistic integration: Edits respect depth, perspective, and lighting from the source image automatically
  • Flexible acceleration: Three hardware tiers (none/regular/high) let you trade latency for cost based on workload priority
  • Output format control: JPEG/PNG/WebP export with configurable safety filtering for production pipelines

Technical Specifications

SpecDetails
ArchitectureLongcat Image Edit 6B
Input FormatsPNG, JPEG, WebP, GIF, AVIF via URL
Output FormatsJPEG, PNG, WebP
Resolution RangeVariable (priced per megapixel)
LicenseCommercial use permitted

API Documentation | Quickstart Guide | Pricing


How It Stacks Up

FASHN Virtual Try-On V1.5 Image to Image – Longcat Image prioritizes general-purpose editing through natural language commands at $0.15/megapixel. FASHN specializes in garment-specific transformations with body-aware fitting for fashion and e-commerce workflows where clothing visualization accuracy justifies specialized architecture.