Try New Grok Imagine here!

Vidu Image to Video

fal-ai/vidu/q2/image-to-video/pro
Use the latest Vidu Q2 models which much more better quality and control on your videos.
Inference
Commercial use

Input

Additional Settings

Customize your input with more control.

Result

Idle

What would you like to do next?

For 720 p your video request would cost 0.1 $ along with a 0.05 $ for every video second. For 1080 p each request will cost 0.3 $ along with 0.1 $ for every video second.

Logs

Q2 Pro | [image-to-video]

Vidu's Q2 Pro image-to-video model delivers 2-8 second video generations at $0.10-$0.80 per video depending on resolution and duration. Trading flexible duration control for higher per-second costs, Q2 Pro targets production workflows requiring precise timing control. Ideal for advertising, social content, and creative prototyping where exact video length matters.

Use Cases: Social Media Content | Product Demonstrations | Creative Prototyping


Performance

Vidu Q2 Pro's variable pricing structure ($0.10 base + $0.05/second for 720p, $0.30 base + $0.10/second for 1080p) trades cost predictability for granular duration control across 2-8 second outputs.

MetricResultContext
Duration Range2-8 secondsAdjustable in 1-second increments via API parameter
Resolution Options720p, 1080pQuality-cost tradeoff: 3x price difference between tiers
Cost per Video (720p)$0.10 + $0.05/sec4-second video = $0.30 total
Cost per Video (1080p)$0.30 + $0.10/sec4-second video = $0.70 total
Movement ControlAuto/Small/Medium/LargeExplicit amplitude parameter for object motion intensity
Related EndpointsVidu Q1 Reference-to-VideoMulti-image reference variant for style consistency workflows

Precision Control Over Generic Automation

Vidu Q2 Pro's architecture separates base generation cost from duration pricing, contrasting with fixed-length models that bundle timing into single-tier pricing. The movement amplitude parameter provides explicit control over motion intensity, small for subtle product reveals, large for dynamic action sequences.

What this means for you:

  • Exact Duration Matching: Generate 5-second Instagram Reels or 3-second product loops without paying for unused frames, critical when platform requirements vary by seconds

  • Resolution-Cost Optimization: Test concepts at 720p ($0.20 for 2-second tests) before committing to 1080p finals ($0.50 for same duration), 60% cost savings during iteration

  • Movement Amplitude Control: Set "small" for product photography animations where camera shake ruins shots, or "large" for action sequences requiring dramatic motion

  • Optional Audio Enhancement: Add background music to 4-second videos via boolean flag, streamlines social content workflows requiring synchronized audio


Technical Specifications

SpecDetails
ArchitectureVidu Q2 Pro
Input FormatsImage URL (JPG, JPEG, PNG, WebP, GIF, AVIF) + text prompt (max 3000 characters)
Output FormatsMP4 video with optional background music
Duration Control2-8 seconds (1-second increments)
LicenseCommercial use permitted

API Documentation | Quickstart Guide | Enterprise Pricing


How It Stacks Up

Vidu Q1 Reference-to-Video ($0.10) – Vidu Q2 Pro ($0.10-$0.80) adds variable duration control and movement amplitude parameters at higher per-second costs. Q1 Reference-to-Video remains ideal for fixed-length workflows requiring multi-image style consistency where duration flexibility isn't critical.

Kling Video v2.6 Image-to-Video ($0.15 base) – Vidu Q2 Pro prioritizes granular duration control through per-second pricing for workflows requiring exact length matching. Kling v2.6 offers fixed-tier pricing for teams preferring predictable costs across standardized durations.

Pixverse Image-to-Video ($0.12) – Vidu Q2 Pro trades cost efficiency for explicit movement amplitude control and 1080p output options. Pixverse emphasizes faster iteration speeds at lower base costs for high-volume concept testing workflows.

LongCat Video Image-to-Video ($0.08) – Vidu Q2 Pro provides dual-resolution options and movement control at 1.25-10x the cost depending on settings. LongCat prioritizes maximum cost efficiency for 720p-only workflows where motion control granularity isn't required.