openrouter/router/vision

Run any Vision Language Model with fal. Analyze and understand images using Claude (Anthropic), GPT-5 / GPT-4o (OpenAI), Gemini (Google), Grok (xAI), Llama (Meta), Qwen, Pixtral (Mistral), and more. Send one or multiple images for captioning, analysis, OCR, or visual Q&A. Powered by OpenRouter.

Inference

Commercial use

Streaming

Partner

Schema

LLMs

Playground API Examples

Prompt examples

Examples are generated using the OpenRouter [Vision]. You can customize them by clicking on the "Playground" button.

text

Close-up, detailed shot of an Inuit man, 30s-40s, with warm brown eyes, a trimmed black beard and mustache, frosted with snow and ice, looking thoughtfully upwards, against a blurred background of a snowstorm and cold, pale blue sky, wearing a thick, faux fur-lined parka with light brown and white fur, covered in snow, with flakes of snow falling on his hair, beard, and parka, his skin has slight blemishes and small scars.

Caption this image for a text-to-image model with as much detail as possible.

Playground