
AI Vision MCP Server
Analyzes images and videos using Google's AI models to answer questions, detect objects, and understand visual content through natural language prompts.
181 viewsLocal (stdio)
What it does
- Analyze single or multiple images with AI
- Detect objects with precise bounding boxes
- Process video content for analysis
- Answer natural language questions about visual media
- Extract text and details from images
Best for
Content creators analyzing media assetsDevelopers building vision-powered applicationsResearchers processing visual data at scale
Powered by Google Gemini and Vertex AISupports both images and videos