AI Vision MCP Server

AI Vision MCP Server

honeyvig

Analyzes images and videos using Google's AI models to answer questions, detect objects, and understand visual content through natural language prompts.

181 viewsLocal (stdio)

What it does

  • Analyze single or multiple images with AI
  • Detect objects with precise bounding boxes
  • Process video content for analysis
  • Answer natural language questions about visual media
  • Extract text and details from images

Best for

Content creators analyzing media assetsDevelopers building vision-powered applicationsResearchers processing visual data at scale
Powered by Google Gemini and Vertex AISupports both images and videos

Alternatives