Puppeteer Vision Web Scraper

Puppeteer Vision Web Scraper

djannot

Automatically scrapes web content by using AI to handle cookie banners, CAPTCHAs, paywalls, and other blocking elements, then converts the extracted content to clean markdown.

47450 views9Local (stdio)

What it does

  • Scrape webpages with stealth mode Puppeteer
  • Handle cookie banners and consent prompts automatically
  • Bypass CAPTCHAs and paywalls with AI interaction
  • Extract main content using Mozilla Readability
  • Convert HTML to well-formatted markdown
  • Process code blocks and tables with special formatting

Best for

Content researchers scraping protected websitesData analysts extracting articles from news sitesDevelopers building content aggregation systemsAnyone needing clean text from complex modern websites
AI-powered interaction with blocking elementsRun instantly via npxReal-time browser viewing option

Alternatives