fal-ai/qwen-image-2/pro/text-to-image
Qwen-Image-2.0 is a next-generation foundational unified generation-and-editing model
Inference
Commercial use
Partner
Input
Additional Settings
Customize your input with more control.
Qwen Image 2.0 Pro β Text to Image
Qwen Image 2.0 Pro is the highest-fidelity text-to-image endpoint from Alibaba's Qwen team. It generates images natively at up to 2048 x 2048 pixels with professional typography rendering, detailed compositions, and support for infographics, posters, and comics. Ranked #1 on AI Arena for text-to-image generation.
When to Use Pro vs Standard
Use the Pro endpoint for final production assets where detail, text accuracy, and composition quality matter most. For rapid iteration and prototyping, the standard endpoint runs faster at lower cost ($0.035/image vs $0.075/image).
Key Parameters
| Parameter | Default | Range | Notes |
|---|---|---|---|
| prompt | β | up to 1,000 tokens | Front-load your main subject for best results |
| negative_prompt | β | string | Elements to exclude (e.g. "blurry, watermark") |
| image_size | square | enum or custom | `square`, `square_hd`, `landscape_4_3`, `landscape_16_9`, `portrait_4_3`, `portrait_16_9` |
| guidance_scale | 5 | 0β20 | 5β7 for production; 8β10 for strict text rendering |
| num_inference_steps | 28 | 1β50 | 30+ recommended for Pro-quality output |
| seed | random | integer | Fix for reproducible results |
| num_images | 1 | 1β4 | Generate multiple variations per request |
| output_format | png | png / jpeg / webp | Choose based on your delivery needs |
Quick Start
pythonimport fal_client result = fal_client.subscribe( "fal-ai/qwen-image-2/pro/text-to-image", arguments={ "prompt": "Minimalist movie poster for 'ECHO STATION', bold sans-serif title at top, lone astronaut in alien desert, two moons, muted teal and burnt orange palette", "image_size": "portrait_4_3", "num_inference_steps": 35, "guidance_scale": 7, } ) print(result["images"][0]["url"])
javascriptimport { fal } from "@fal-ai/client"; const result = await fal.subscribe("fal-ai/qwen-image-2/pro/text-to-image", { input: { prompt: "Minimalist movie poster for 'ECHO STATION', bold sans-serif title at top, lone astronaut in alien desert, two moons, muted teal and burnt orange palette", image_size: "portrait_4_3", num_inference_steps: 35, guidance_scale: 7, }, }); console.log(result.data.images[0].url);
Prompting Tips
- Structure prompts as: Subject, style, details, composition, lighting. The model weights information by position, so put the most important elements first.
- For text-heavy images (posters, infographics, slides): increase
`guidance_scale`to 7β10 and`num_inference_steps`to 35+ for sharper text rendering. - Use negative prompts to suppress common artifacts:
`"blurry, low quality, distorted, deformed, oversaturated, watermark"`. - Specify aspect ratio to match your output context β
`landscape_16_9`for video thumbnails,`portrait_4_3`for mobile,`square_hd`for maximum detail.
Best Use Cases
- Infographics and data visualizations β the model can compose complex multi-section layouts with accurate text placement
- Movie posters and print design β professional typography with correct perspective on surfaces
- Product photography β editorial-quality renders with controlled lighting and composition
- Comics and storyboards β multi-panel layouts with consistent character rendering
- Social media assets β branded content with text overlays rendered directly in the image
Related Endpoints
- Text to Image (Standard) β faster, lower cost for iteration
- Image Editing (Pro) β edit existing images with natural language
- API Reference β full parameter documentation
