MoonDreamNext Vision

fal-ai/moondream-next
MoonDreamNext is a multimodal vision-language model for captioning, gaze detection, bbox detection, point detection, and more.
Inference
Commercial use

Input

Additional Settings

Customize your input with more control.

Result

Idle

Waiting for your input...

Your request will cost $0 per compute second.

Logs