Wan-2.1 Text-to-Video Text to Video

fal-ai/wan-t2v
Wan-2.1 is a text-to-video model that generates high-quality videos with high visual quality and motion diversity from text prompts
Inference
Commercial use

Input

Additional Settings

Customize your input with more control.

Result

Idle
This generation takes approximately 1m.

For a video generation, your request will cost $0.2 at 480p resolution and 0.4$ at 720p resolution. For $1 you can run this model approximately 5 times. More than the default frames will cost 1.25x more.

Logs

Readme

WAN-2.1 Text-to-Video Model

Transform text descriptions into dynamic video content with advanced AI

Overview

WAN-2.1 Text-to-Video is a powerful model that converts natural language descriptions into high-quality video sequences. Whether you're creating content for social media, marketing campaigns, or creative projects, this model helps you bring your ideas to life through video.

Key Capabilities

Transform text prompts into fluid video content with precise control over:

Video Generation

  • Create 5-6 second video clips from detailed text descriptions
  • Support for diverse styles from photorealistic to artistic
  • Consistent quality and coherent motion throughout sequences

Advanced Controls

  • Fine-tune video length (81-100 frames)
  • Adjust frame rate (5-24 FPS)
  • Control resolution (480p, 580p, or 720p)
  • Select aspect ratio (16:9 or 9:16)

Getting Started

Setting up WAN-2.1 Text-to-Video takes just a few minutes. Here's how to begin:

  1. Install the SDK for your preferred language:
# JavaScript/TypeScript
npm install --save @fal-ai/client

# Python
pip install fal-client
  1. Configure your credentials:
import { fal } from "@fal-ai/client";

fal.config({
  credentials: "YOUR_FAL_KEY"
});
  1. Generate your first video:
const result = await fal.subscribe("fal-ai/wan-t2v", {
  input: {
    prompt: "A stylish woman walks down a Tokyo street filled with warm glowing neon and animated city signage"
  }
});

Implementation Best Practices

Maximize the quality of your generated videos by following these guidelines:

Write Effective Prompts

  • Be specific about motion and action in your descriptions
  • Include details about lighting, atmosphere, and style
  • Specify camera movements if desired
  • Avoid mentioning static elements or still images

Optimize Performance

  • Use turbo mode for faster generation when quality trade-offs are acceptable
  • Enable prompt expansion for better results
  • Implement proper error handling
  • Monitor API usage through your dashboard

Technical Integration

The API supports both synchronous and asynchronous responses. For production use, implement the queue API:

// Submit request
const { request_id } = await fal.queue.submit("fal-ai/wan-t2v", {
  input: {
    prompt: "Your detailed prompt",
    resolution: "720p",
    aspect_ratio: "16:9"
  },
  webhookUrl: "https://your.webhook.url"
});

// Check status
const status = await fal.queue.status("fal-ai/wan-t2v", {
  requestId: request_id
});

// Get result
const result = await fal.queue.result("fal-ai/wan-t2v", {
  requestId: request_id
});

Use Cases

WAN-2.1 Text-to-Video excels in various applications:

Content Creation Create engaging social media content, product demonstrations, and creative visuals without traditional video production overhead.

Educational Content Generate explanatory videos for complex concepts, making learning more engaging and accessible.

Marketing and Advertising Quickly produce video assets for campaigns, product launches, and promotional materials.

Technical Specifications

Input Parameters:

  • prompt: Text description (required)
  • negative_prompt: What to avoid in generation
  • num_frames: 81-100 frames
  • frames_per_second: 5-24 FPS
  • resolution: 480p, 580p, or 720p
  • aspect_ratio: 16:9 or 9:16
  • enable_safety_checker: boolean
  • enable_prompt_expansion: boolean
  • turbo_mode: boolean for faster generation

Output:

  • Video file in MP4 format
  • Generation seed for reproducibility

Pricing and Usage

Simple, transparent pricing:

  • $0.40 per video generation
  • Processing time: Approximately 1 minute
  • Pay-per-use model with no subscription required
  • No hidden fees or minimum commitments

View detailed pricing or contact sales for enterprise solutions.

Support and Resources

We're here to help you succeed with WAN-2.1 Text-to-Video:

  • Comprehensive API documentation at fal.ai/models/fal-ai/wan-t2v/api
  • Active Discord community for peer support
  • Direct technical support through our help desk
  • Regular model updates and improvements

Ready to transform your text into compelling video content? Get started with WAN-2.1 Text-to-Video today by creating an account at fal.ai/login.