Florence-2 Large Vision
fal-ai/florence-2-large/more-detailed-caption
Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks
Inference
Commercial use
Input
Hint: you can drag and drop file(s) here, or provide a base64 encoded data URL Accepted file types: jpg, jpeg, png, webp
![](https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/transformers/tasks/car.jpg)
![](http://ecx.images-amazon.com/images/I/51UUzBDAMsL.jpg)
Result
Idle
Loading pricing info...
Logs
Related Models
fal-ai/florence-2-large/detailed-caption
vision
Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks
captioning
multimodal
vision
fal-ai/florence-2-large/caption
vision
Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks
captioning
multimodal
vision
fal-ai/imageutils/nsfw
vision
Predict the probability of an image being NSFW.
filter
safety
utility