InternLM XComposer 2 7B Vision
fal-ai/internlm-xcomposer-2-7b
A general vision-language large model (VLLM) based on InternLM2, with the capability of 4K resolution image understanding.
Inference
Commercial use
Input
Hint: Drag and drop image files from your computer, images from web pages, paste from clipboard (Ctrl/Cmd+V), or provide a URL. Accepted file types: jpg, jpeg, png, webp, gif, avif

Result
Idle
What would you like to do next?
Your request will cost $0 per compute second.