Gemini 3 Pro Image Preview Image to Image
Input
Hint: Drag and drop files from your computer, images from web pages, paste from clipboard (Ctrl/Cmd+V), or provide a URL.
Customize your input with more control.
Result
What would you like to do next?
Your request will cost $0.15 per image. For $1.00, you can run this model 7 times. 4K outputs will be charged at double the standard rate. Note: Pricing may change in the future.
Logs
Nano Banana Pro Edit (preview) | [image-to-image]
Google's Nano Banana Pro Edit transforms existing visuals through natural language commands at $0.15 per edit, processing up to 2 reference images simultaneously. Trading speed for semantic understanding, this model interprets complex editing instructions without requiring masks or layers. You describe the change, it understands context and executes across multiple inputs. Ideal for teams needing sophisticated edits through conversational prompts rather than technical tooling.
Use Cases: Multi-image context editing | Natural language photo manipulation | Reference-guided transformations
Performance
At $0.15 per edit, Nano Banana Pro Edit sits at the premium end of fal's image editing spectrum, trading cost efficiency for advanced reasoning capabilities that interpret nuanced editing instructions across multiple reference images.
| Metric | Result | Context |
|---|---|---|
| Resolution Range | 1K to 4K | 4K outputs charged at 2x standard rate ($0.30) |
| Multi-Image Support | Up to 2 reference images | Enables context-aware edits across multiple visual inputs |
| Cost per Edit | $0.15 | 7 edits per $1.00 on fal |
| Output Formats | JPEG, PNG, WebP | Flexible export for web and production workflows |
| Related Endpoints | Gemini 2.5 Flash Image Edit, Gemini Flash Edit Multi | Speed-optimized vs multi-image variants |
Context-Aware Editing Without Technical Overhead
Unlike traditional image editors requiring masks, layers, or precise selection tools, Nano Banana Pro Edit interprets natural language instructions and applies them contextually across your provided reference images. The model leverages Gemini 3 Pro's reasoning architecture to understand spatial relationships, object boundaries, and semantic intent.
What this means for you:
-
Multi-image reasoning: Process up to 2 reference images in a single request—the model understands relationships between inputs and applies edits coherently across the visual context
-
Natural language precision: Describe complex transformations conversationally ("make a photo of the man driving the car down the california coastline") without technical syntax or masking workflows
-
Resolution flexibility: Generate edits from 1K for rapid iteration up to 4K (2048x2048, up to 4 megapixels) for production-ready outputs, with transparent 2x pricing at higher resolutions
-
Optional web context: Enable real-time web search during generation to incorporate current visual references or styling trends directly into edits
Technical Specifications
| Spec | Details |
|---|---|
| Architecture | Gemini 3 Pro Image |
| Input Formats | JPEG, PNG, WebP via URL (2 images max) |
| Output Formats | JPEG, PNG, WebP |
| Resolution Options | 1K, 2K, 4K (up to 2048x2048) |
| License | Commercial use via fal partnership |
API Documentation | Quickstart Guide | Enterprise Pricing
How It Stacks Up
Gemini 2.5 Flash Image Edit ($0.039) – Nano Banana Pro Edit trades speed and cost efficiency (4x more expensive at $0.15 vs $0.039) for enhanced semantic reasoning and multi-image context understanding. Gemini 2.5 Flash remains ideal for high-volume simple edits where iteration speed and cost per edit matter more than complex instruction interpretation.
Gemini Flash Edit Multi ($0.039) – Nano Banana Pro Edit prioritizes advanced reasoning through Gemini 3's architecture for nuanced natural language instructions at $0.15 per edit. Flash Edit Multi offers comparable multi-image support at $0.039 (4x more cost-effective) for workflows prioritizing throughput over instruction complexity.


