Alibaba logo

Alibaba

Creators of the Happy Horse, Qwen and Wan model families - open-weight image and video generation from Alibaba's Tongyi Lab

Operational
image-to-image
video-to-video
text-to-image
image-to-video

Models on fal

Happy Horse

4 variants available
Alibaba logo
alibaba/happy-horse/video-edit
Alibaba logo
alibaba/happy-horse/reference-to-video
Alibaba logo
alibaba/happy-horse/image-to-video
Alibaba logo
alibaba/happy-horse/text-to-video

Wan 2.7

8 variants available
Alibaba logo
fal-ai/wan/v2.7/pro/edit
Alibaba logo
fal-ai/wan/v2.7/edit
Alibaba logo
fal-ai/wan/v2.7/text-to-image
Alibaba logo
fal-ai/wan/v2.7/pro/text-to-image
Alibaba logo
fal-ai/wan/v2.7/text-to-video

Z-Image

14 variants available

Qwen Image

6 variants available

Wan Trainer

5 variants available

Wan 2.6

7 variants available

Qwen 3

5 variants available

Qwen

39 variants available

Wan

17 variants available

Wan 2.5

4 variants available

Wan 2.2

23 variants available

Wan 2.1

1 variant available

About

Alibaba's generative models come out of Tongyi Lab, the AI research division of Alibaba Cloud. The lab develops the Qwen family, the brand Alibaba has unified its large-model work under, alongside the Wan video models, originally launched as Tongyi Wanxiang. Many are released open-weight under permissive licenses, making them a popular foundation for developers and the open-source community.

On the visual side, Qwen-Image is a text-to-image foundation model known for state-of-the-art complex text rendering, while Qwen-Image-Edit brings precise, instruction-driven image editing.

In video, the Wan series spans text-to-video and image-to-video generation, with Wan 2.5 introducing native synchronized audio and later versions adding reference-to-video, multi-shot control, and flexible aspect ratios.

Notable model families on fal:

  • Happy Horse: text- and image-to-video, with native audio generation
  • Qwen-Image: text-to-image with leading text rendering
  • Qwen-Image-Edit: instruction-based image editing
  • Wan 2.1 / 2.2 / 2.5 / 2.6: text- and image-to-video, some with native audio
  • Wan-Animate: character animation driven by reference video

Try it in fal Sandbox

A breathtaking ancient monastery floating among massive clouds thousands of feet above the earth. Golden sunrise light pierces through the atmosphere, illuminating towering stone structures connected by elegant bridges suspended in the sky. Monks in flowing robes walk along the pathways while giant birds glide through the clouds below. Epic fantasy architecture, cinematic composition, volumetric lighting, god rays, immense scale, ultra-detailed stonework, dreamlike atmosphere, photorealistic rendering, masterpiece environment design, Unreal Engine 5 quality, 8K.

Alibaba logo
fal-ai/z-image/base
Alibaba logo
fal-ai/qwen-image-2512
+1

A classic red Ferrari parked along a winding cliffside road on Italy's Amalfi Coast during golden hour. The Mediterranean Sea sparkles beneath dramatic coastal cliffs while colorful villages cling to the mountainside in the distance. Warm sunlight reflects off the polished bodywork. Luxury travel meets automotive photography, cinematic composition, rich Mediterranean colors, photorealistic details, Vogue travel editorial aesthetic, ultra-sharp focus, depth and atmosphere, award-winning commercial photography, 8K masterpiece.

Alibaba logo
fal-ai/wan/v2.7/text-to-image
Alibaba logo
fal-ai/z-image/base
+1

fal Learn

LUX vs. Qwen Image: What's The Difference?

How to Use Qwen Image 2: Practical Tips for Better Images

Qwen Image 2512 Prompt Guide

Qwen Image Edit 2511 Developer Guide

Qwen Image Layered Developer Guide

Qwen Image Layered Trainer Developer Guide

Wan 2.6 Prompt Guide: Mastering Three Generation Modes

Wan 2.6 Developer Guide: Next-Generation Video Generation

fal Academy

Happy Horse 1.0 Takes the Lead! | API Tutorial

Introducing Wan 2.5 - Native Audio Generation!