Moondream3 Preview [Segment] Image to Image

fal-ai/moondream3-preview/segment
Moondream 3 is a vision language model that brings frontier-level visual reasoning with native object detection, pointing, and OCR capabilities to real-world applications requiring fast, inexpensive inference at scale.
Inference
Commercial use

Input

Additional Settings

Customize your input with more control.

Result

Idle

What would you like to do next?

Your request will cost $0.4 per million input tokens, and $3.5 per million output tokens.

Logs