Lumina Image 2 Text to Image
Input
Customize your input with more control.
Logs
Readme
Lumina Image 2.0 - Advanced Flow-Based Text-to-Image Generation
Transform your creative vision into stunning images with Lumina Image 2.0, a powerful 2 billion parameter flow-based diffusion transformer built for developers who need reliable, high-quality results at scale.
Overview
Lumina Image 2.0 is a state-of-the-art text-to-image generation model that features improved performance in image quality, typography, complex prompt understanding, and resource-efficiency. Whether you're building a creative tool, content generation platform, or enhancing your application's visual features, Lumina Image provides the foundation you need with its unified framework approach.
Key Features
Transform natural language descriptions into detailed, high-quality images with advanced controls:
- 2 billion parameter flow-based diffusion transformer architecture
- Enhanced typography and text rendering capabilities
- Commercial usage rights included (Apache 2.0 license)
- Support for resolutions up to 1024x1024
- RESTful API with comprehensive SDKs
- Streaming support for real-time generation feedback
Getting Started
Getting up and running with Lumina Image takes just a few minutes. Here's how to begin:
- Install the SDK for your preferred language:
For JavaScript/TypeScript:
For Python:
- Configure your authentication:
- Generate your first image:
Integration Examples
Here's a practical example of integrating Lumina Image into a web application:
API Parameters
- (required): Text description of the image to generate
- : Output image height (default: 1024)
- : Output image width (default: 1024)
- : Strength of prompt adherence (default: 4.0)
- : Number of denoising steps (default: 50)
- : CFG truncation ratio (default: 0.25)
- : Enable CFG normalization (default: true)
- : Random seed for reproducible results
Best Practices
Maximize the quality of your generated images by following these guidelines:
- Write clear, detailed prompts that specify both content and style
- Include artistic references when seeking specific visual outcomes
- Use the Gemma-2-2B text encoder's capabilities for complex prompt understanding
- Implement proper error handling and retry logic
- Consider using streaming for better user experience
Advanced Usage
For more complex use cases, Lumina Image supports advanced parameters:
Technical Architecture
Core Components:
- Model: 2B parameter Flow-based Large Diffusion Transformer (Flag-DiT)
- Text Encoder: Gemma-2-2B for enhanced prompt understanding
- VAE: FLUX-VAE-16CH variational autoencoder
- Framework: Unified architecture treating text and image tokens jointly
Pricing
- Cost: $0.075 per megapixel
- Transparent usage-based pricing
- No minimum commitments
Queue Management
For asynchronous processing:
Error Handling
Implement robust error handling to ensure a smooth user experience:
Technical Support
Our documentation is continuously updated with new examples and best practices. For additional support:
- Visit our comprehensive API documentation
- Join our developer community
- Contact our technical support team
- Monitor system status at status.fal.ai
Getting Started Today
- Sign up for a fal.ai account
- Generate your API key
- Install the SDK
- Make your first API call
Start building with Lumina Image 2.0 today and bring your creative vision to life through the power of advanced AI-generated imagery.