FLUX.2 is now live!

Nano Banana Pro Image to Image

fal-ai/nano-banana-pro/edit
Nano Banana Pro (a.k.a Nano Banana 2) is Google's new state-of-the-art image generation and editing model
Inference
Commercial use
Partner

Input

Additional Settings

Customize your input with more control.

Result

Idle

What would you like to do next?

Your request will cost $0.15 per image. For $1.00, you can run this model 7 times. 4K outputs will be charged at double the standard rate. If web search is used, an additional $0.015 will be charged. Note: Pricing may change in the future.

Logs

Nano Banana 2 [image-editing]

Google's Gemini 3 Pro Image architecture transforms existing visuals at $0.15 per edit, bringing advanced reasoning and multimodal understanding to pixel-level manipulation. Trading raw speed for semantic awareness and studio-quality output, it interprets complex editing instructions like "make the sunset more dramatic while preserving the original mood" with no masks, no layers, just natural language directing precise transformations with enhanced text rendering and character consistency.

Built for: Product iteration workflows | Creative asset refinement | Context-aware photo editing | Multi-image composition

Semantic Editing Without Masks

Nano Banana 2 (aka Nano Banana Pro and officially Gemini 3 Pro Image) applies Google's advanced reasoning foundation model to image editing, understanding relationships between objects, lighting, and composition rather than treating pixels as isolated data points. This represents a significant upgrade from the original Nano Banana (Gemini 2.5 Flash Image), with enhanced capabilities for complex compositions and text rendering.

What this means for you:

  • Natural language precision: "Change the car color to midnight blue while maintaining reflections" executes without manual selection. The model understands what "the car" means in context and preserves scene coherence
  • Composition-aware transforms: Edits respect depth, perspective, and lighting automatically. No manual masking of shadows or reflections required
  • Batch processing capability: Generate up to 4 variations simultaneously to explore creative directions
  • Reference image support: Provide multiple reference images (combine up to 14 images) for style guidance or target aesthetics. The model interprets visual intent alongside text instructions
  • Character consistency: Maintain resemblance and consistency for up to 5 people across edits

Performance Optimized for Quality

Built on Google's Gemini 3 Pro architecture, prioritizing quality and reasoning depth over raw speed.

MetricResultContext
Generation PhilosophyQuality-firstPrioritizes complex compositions and accuracy over speed metrics
Cost per Image$0.15~7 edits per $1.00 on fal.ai; 4K outputs charged at 2x rate
Resolution Options1K, 2K, 4KConfigurable via API; higher resolutions increase token usage
Batch Size1-4 imagesVia parameter
ArchitectureGemini 3 Pro ImageMultimodal foundation model with advanced reasoning

Note: Generation times not publicly benchmarked; model optimized for quality rather than speed


Technical Specifications

SpecDetails
ArchitectureGemini 3 Pro Image (Nano Banana 2)
Model Identifier
Input FormatsImage URLs (required) + text prompt (required)
Output FormatsPNG, JPEG, WebP (configurable)
Resolution Options1K (1024px), 2K (2048px), 4K (higher cost)
Aspect RatiosAuto, 21:9, 16:9, 3:2, 4:3, 5:4, 1:1, 4:5, 3:4, 2:3, 9:16
Multi-Image SupportCombine up to 14 images in single composition
Character ConsistencyMaintains resemblance for up to 5 people
WatermarkingSynthID digital watermarking on all outputs
LicenseCommercial use permitted

API Documentation


How It Stacks Up

Compare Nano Banana 2 with:

FLUX.1 [dev] - Nano Banana 2 prioritizes semantic understanding through Gemini 3 Pro's reasoning architecture, making it ideal for complex transformations described in natural language without manual masking. FLUX.1 [dev] emphasizes maximum resolution control and fine detail preservation for precision editing workflows requiring technical control.

Stable Diffusion 3.5 - Nano Banana 2 leverages Google's production-scale multimodal training and advanced reasoning for context-aware edits that understand object relationships, composition, and maintain character consistency. Stable Diffusion 3.5 offers open-source flexibility for custom fine-tuning and local deployment in specialized editing pipelines.

DALL-E 3 - Nano Banana 2 provides superior text rendering capabilities and multi-image composition (up to 14 images) with professional creative controls. DALL-E 3 prioritizes safety filtering and artistic coherence for consumer-facing creative applications with stricter content guidelines.

Original Nano Banana (Gemini 2.5 Flash Image) - Nano Banana 2 trades speed for quality, offering enhanced reasoning, superior text rendering, better character consistency, professional-grade creative controls, and advanced composition capabilities at higher cost ($0.15 vs $0.039). Original Nano Banana remains available for rapid iterations and simple edits.