Moondream3 Preview [Point] Large Language Models
fal-ai/moondream3-preview/point
Moondream 3 is a vision language model that brings frontier-level visual reasoning with native object detection, pointing, and OCR capabilities to real-world applications requiring fast, inexpensive inference at scale.
Inference
Commercial use
Input
Hint: Drag and drop image files from your computer, images from web pages, paste from clipboard (Ctrl/Cmd+V), or provide a URL. Accepted file types: jpg, jpeg, png, webp, gif, avif

Result
Idle
Your request will cost $0.3 per million input tokens, and $2.5 per million output tokens.