Florence-2 Large Image to Image
fal-ai/florence-2-large/caption-to-phrase-grounding
Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks
Inference
Commercial use
Input
Hint: you can drag and drop file(s) here, or provide a base64 encoded data URL Accepted file types: jpg, jpeg, png, webp
Result
Idle
Waiting for your input...
Loading pricing info...
Logs
Related Models
fal-ai/ideogram/v2/remix
image-to-image
Reimagine existing images with Ideogram V2's remix feature. Create variations and adaptations while preserving core elements and adding new creative directions through prompt guidance.
realism
typography
fal-ai/aura-sr
image-to-image
Upscale your images with AuraSR.
upscaling
high-res
fal-ai/florence-2-large/dense-region-caption
image-to-image
Florence-2 is an advanced vision foundation model that uses a prompt-based approach to handle a wide range of vision and vision-language tasks
multimodal
vision