Lumina Image 2 Text to Image

fal-ai/lumina-image/v2
Lumina-Image-2.0 is a 2 billion parameter flow-based diffusion transforer which features improved performance in image quality, typography, complex prompt understanding, and resource-efficiency.
Inference
Commercial use
Streaming

Input

Additional Settings

Customize your input with more control.

Streaming

Result

Idle

What would you like to do next?

Your request will cost $0.075 per megapixel.

Logs

Readme

Lumina Image 2.0 - Advanced Flow-Based Text-to-Image Generation

Transform your creative vision into stunning images with Lumina Image 2.0, a powerful 2 billion parameter flow-based diffusion transformer built for developers who need reliable, high-quality results at scale.

Overview

Lumina Image 2.0 is a state-of-the-art text-to-image generation model that features improved performance in image quality, typography, complex prompt understanding, and resource-efficiency. Whether you're building a creative tool, content generation platform, or enhancing your application's visual features, Lumina Image provides the foundation you need with its unified framework approach.

Key Features

Transform natural language descriptions into detailed, high-quality images with advanced controls:

  • 2 billion parameter flow-based diffusion transformer architecture
  • Enhanced typography and text rendering capabilities
  • Commercial usage rights included (Apache 2.0 license)
  • Support for resolutions up to 1024x1024
  • RESTful API with comprehensive SDKs
  • Streaming support for real-time generation feedback
Getting Started

Getting up and running with Lumina Image takes just a few minutes. Here's how to begin:

  1. Install the SDK for your preferred language:

For JavaScript/TypeScript:


For Python:


  1. Configure your authentication:

  1. Generate your first image:

Integration Examples

Here's a practical example of integrating Lumina Image into a web application:


API Parameters
  • (required): Text description of the image to generate
  • : Output image height (default: 1024)
  • : Output image width (default: 1024)
  • : Strength of prompt adherence (default: 4.0)
  • : Number of denoising steps (default: 50)
  • : CFG truncation ratio (default: 0.25)
  • : Enable CFG normalization (default: true)
  • : Random seed for reproducible results
Best Practices

Maximize the quality of your generated images by following these guidelines:

  • Write clear, detailed prompts that specify both content and style
  • Include artistic references when seeking specific visual outcomes
  • Use the Gemma-2-2B text encoder's capabilities for complex prompt understanding
  • Implement proper error handling and retry logic
  • Consider using streaming for better user experience
Advanced Usage

For more complex use cases, Lumina Image supports advanced parameters:


Technical Architecture

Core Components:

  • Model: 2B parameter Flow-based Large Diffusion Transformer (Flag-DiT)
  • Text Encoder: Gemma-2-2B for enhanced prompt understanding
  • VAE: FLUX-VAE-16CH variational autoencoder
  • Framework: Unified architecture treating text and image tokens jointly
Pricing
  • Cost: $0.075 per megapixel
  • Transparent usage-based pricing
  • No minimum commitments
Queue Management

For asynchronous processing:


Error Handling

Implement robust error handling to ensure a smooth user experience:


Technical Support

Our documentation is continuously updated with new examples and best practices. For additional support:

  • Visit our comprehensive API documentation
  • Join our developer community
  • Contact our technical support team
  • Monitor system status at status.fal.ai
Getting Started Today
  1. Sign up for a fal.ai account
  2. Generate your API key
  3. Install the SDK
  4. Make your first API call

Start building with Lumina Image 2.0 today and bring your creative vision to life through the power of advanced AI-generated imagery.