Step-Video Text to Video
fal-ai/stepfun-video
Step-Video is a state-of-the-art (SoTA) text-to-video pre-trained model with 30 billion parameters and the capability to generate videos up to 204 frames.
Inference
Partner
Commercial use
Input
Additional Settings
Customize your input with more control.
Result
Idle
This generation takes approximately 20m.
Loading pricing info...
Logs
Related Models
fal-ai/minimax/video-01-director
text-to-video
Generate video clips more accurately with respect to natural language descriptions and using camera movement instructions for shot control.
motion
transformation
camera-controls
fal-ai/veo2
text-to-video
Veo 2 creates videos with realistic motion and high quality output. Explore different styles and find your own with extensive camera controls.
new
motion
transformation
fal-ai/minimax/video-01-live
text-to-video
Generate video clips from your prompts using MiniMax model
motion
transformation