Run the latest models all in one Sandbox 🏖️

MoonDreamNext Vision

fal-ai/moondream-next
MoonDreamNext is a multimodal vision-language model for captioning, gaze detection, bbox detection, point detection, and more.
Inference
Commercial use

Input

Additional Settings

Customize your input with more control.

Result

Idle

What would you like to do next?

Your request will cost $0.0011 per second.

Logs