Moondream Vision
fal-ai/moondream/batched
Answer questions from the images.
Inference
Partner
Commercial use
Input
Hint: you can drag and drop file(s) here, or provide a base64 encoded data URL Accepted file types: jpg, jpeg, png, webp


Additional Settings
Customize your input with more control.
Result
Idle
Loading pricing info...
Logs
Related Models
fal-ai/florence-2-large/region-to-description
vision
Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks
multimodal
vision
fal-ai/got-ocr/v2
vision
GOT-OCR2 works on a wide range of tasks, including plain document OCR, scene text OCR, formatted document OCR, and even OCR for tables, charts, mathematical formulas, geometric shapes, molecular formulas and sheet music.
new
optical character recognition
high-res
utility
fal-ai/any-llm/vision
vision
Use any vision language model from our selected catalogue (powered by OpenRouter)
multimodal
vision
streaming