fal-ai/vidu/q2/image-to-video/pro

Use the latest Vidu Q2 models which much more better quality and control on your videos.
Inference
Commercial use

Input

Type # to reference inputs.

Additional Settings

Customize your input with more control.

Result

Idle

What would you like to do next?

For 720 p your video request would cost 0.1 $ along with a 0.05 $ for every video second. For 1080 p each request will cost 0.3 $ along with 0.1 $ for every video second.

Logs

Q2 Pro | [image-to-video]

Vidu's Q2 Pro image-to-video model delivers 2-8 second video generations at $0.10-$0.80 per video depending on resolution and duration. Trading flexible duration control for higher per-second costs, Q2 Pro targets production workflows requiring precise timing control. Ideal for advertising, social content, and creative prototyping where exact video length matters.

Use Cases: Social Media Content | Product Demonstrations | Creative Prototyping


Performance

Vidu Q2 Pro's variable pricing structure ($0.10 base + $0.05/second for 720p, $0.30 base + $0.10/second for 1080p) trades cost predictability for granular duration control across 2-8 second outputs.

MetricResultContext
Duration Range2-8 secondsAdjustable in 1-second increments via API parameter
Resolution Options720p, 1080pQuality-cost tradeoff: 3x price difference between tiers
Cost per Video (720p)$0.10 + $0.05/sec4-second video = $0.30 total
Cost per Video (1080p)$0.30 + $0.10/sec4-second video = $0.70 total
Movement ControlAuto/Small/Medium/LargeExplicit amplitude parameter for object motion intensity
Related EndpointsVidu Q1 Reference-to-VideoMulti-image reference variant for style consistency workflows

Precision Control Over Generic Automation

Vidu Q2 Pro's architecture separates base generation cost from duration pricing, contrasting with fixed-length models that bundle timing into single-tier pricing. The movement amplitude parameter provides explicit control over motion intensity, small for subtle product reveals, large for dynamic action sequences.

What this means for you:

  • Exact Duration Matching: Generate 5-second Instagram Reels or 3-second product loops without paying for unused frames, critical when platform requirements vary by seconds

  • Resolution-Cost Optimization: Test concepts at 720p ($0.20 for 2-second tests) before committing to 1080p finals ($0.50 for same duration), 60% cost savings during iteration

  • Movement Amplitude Control: Set "small" for product photography animations where camera shake ruins shots, or "large" for action sequences requiring dramatic motion

  • Optional Audio Enhancement: Add background music to 4-second videos via boolean flag, streamlines social content workflows requiring synchronized audio


Technical Specifications

SpecDetails
ArchitectureVidu Q2 Pro
Input FormatsImage URL (JPG, JPEG, PNG, WebP, GIF, AVIF) + text prompt (max 3000 characters)
Output FormatsMP4 video with optional background music
Duration Control2-8 seconds (1-second increments)
LicenseCommercial use permitted

API Documentation | Quickstart Guide | Enterprise Pricing


How It Stacks Up

Vidu Q1 Reference-to-Video ($0.10) – Vidu Q2 Pro ($0.10-$0.80) adds variable duration control and movement amplitude parameters at higher per-second costs. Q1 Reference-to-Video remains ideal for fixed-length workflows requiring multi-image style consistency where duration flexibility isn't critical.

Kling Video v2.6 Image-to-Video ($0.15 base) – Vidu Q2 Pro prioritizes granular duration control through per-second pricing for workflows requiring exact length matching. Kling v2.6 offers fixed-tier pricing for teams preferring predictable costs across standardized durations.

Pixverse Image-to-Video ($0.12) – Vidu Q2 Pro trades cost efficiency for explicit movement amplitude control and 1080p output options. Pixverse emphasizes faster iteration speeds at lower base costs for high-volume concept testing workflows.

LongCat Video Image-to-Video ($0.08) – Vidu Q2 Pro provides dual-resolution options and movement control at 1.25-10x the cost depending on settings. LongCat prioritizes maximum cost efficiency for 720p-only workflows where motion control granularity isn't required.