Analyzes images and videos using Google's Gemini or Vertex AI models, with intelligent file handling for different content types and sizes.

42504 views11Local (stdio)

What it does

  • Analyze images with AI-powered vision models
  • Process video content for insights and analysis
  • Compare multiple images side-by-side
  • Upload files via URLs, local paths, or base64
  • Store and manage media files in Google Cloud Storage
  • Switch between Gemini API and Vertex AI providers

Best for

Content creators analyzing visual mediaDevelopers building vision-enabled applicationsResearchers processing image/video datasetsTeams needing automated visual content analysis
Dual provider support (Gemini + Vertex AI)Handles both images and videosIntelligent file upload optimization

Alternatives