Sa2VA 8B Video Vision
fal-ai/sa2va/8b/video
Sa2VA is an MLLM capable of question answering, visual prompt understanding, and dense object segmentation at both image and video levels
Inference
Commercial use
Input
Hint: Drag and drop video files from your computer, video from web pages, paste from clipboard (Ctrl/Cmd+V), or provide a URL. Accepted file types: mp4, mov, webm, m4v, gif
Additional Settings
Customize your input with more control.
Result
Idle
Your request will cost $0.08 per second.