InternLM XComposer 2 7B Vision

fal-ai/internlm-xcomposer-2-7b
A general vision-language large model (VLLM) based on InternLM2, with the capability of 4K resolution image understanding.
Inference
Commercial use

Input

Result

Idle
Leonardo da Vinci

Your request will cost $0.00111 per compute second.

Logs