BrowserCat

Name: BrowserCat
Rating: 4.7 (25 reviews)
Author: browsercat

Official

Provides cloud-based browser automation for LLMs to navigate websites, interact with elements, and take screenshots without installing browsers locally.

Enables LLMs to interact with web pages through cloud-based browser automation for navigation, screenshot capture, element interaction, and JavaScript execution without local browser installation.

5427 views10Local (stdio)

browser automation developer tools

GitHub Website

What it does

Navigate to any web page
Take full page or element screenshots
Click and hover on page elements
Fill forms and select dropdown options
Execute JavaScript in browser console
Access browser console logs

Best for

Web scraping and data extractionAutomated testing of web applicationsAI agents that need to interact with websitesContent monitoring and verification

Cloud-based — no local browser neededReal browser environmentScreenshot capture with element targeting

About BrowserCat

BrowserCat is an official MCP server published by browsercat that provides AI assistants with tools and capabilities via the Model Context Protocol. BrowserCat offers cloud-based Selenium test automation for software testing, enabling LLM-driven web navigation and inte It is categorized under browser automation, developer tools.

How to install

You can install BrowserCat in your AI client of choice. Use the install panel on this page to get one-click setup for Cursor, Claude Desktop, VS Code, and other MCP-compatible clients. This server runs locally on your machine via the stdio transport.

License

BrowserCat is released under the MIT license. This is a permissive open-source license, meaning you can freely use, modify, and distribute the software.

BrowserCat MCP Server

A Model Context Protocol server that provides browser automation capabilities using BrowserCat's cloud browser service. This server enables LLMs to interact with web pages, take screenshots, and execute JavaScript in a real browser environment without needing to install browsers locally.

Components

Tools

browsercat_navigate
- Navigate to any URL in the browser
- Input: url (string)
browsercat_screenshot
- Capture screenshots of the entire page or specific elements
- Inputs:
  - name (string, required): Name for the screenshot
  - selector (string, optional): CSS selector for element to screenshot
  - width (number, optional, default: 800): Screenshot width
  - height (number, optional, default: 600): Screenshot height
browsercat_click
- Click elements on the page
- Input: selector (string): CSS selector for element to click
browsercat_hover
- Hover elements on the page
- Input: selector (string): CSS selector for element to hover
browsercat_fill
- Fill out input fields
- Inputs:
  - selector (string): CSS selector for input field
  - value (string): Value to fill
browsercat_select
- Select an option from a dropdown menu
- Inputs:
  - selector (string): CSS selector for select element
  - value (string): Value to select
browsercat_evaluate
- Execute JavaScript in the browser console
- Input: script (string): JavaScript code to execute

Resources

The server provides access to two types of resources:

Console Logs (console://logs)
- Browser console output in text format
- Includes all console messages from the browser
Screenshots (screenshot://<name>)
- PNG images of captured screenshots
- Accessible via the screenshot name specified during capture

Key Features

Cloud-based browser automation
No local browser installation required
Console log monitoring
Screenshot capabilities
JavaScript execution
Basic web interaction (navigation, clicking, form filling)

Configuration to use BrowserCat MCP Server

Environment Variables

The BrowserCat MCP server requires the following environment variable:

BROWSERCAT_API_KEY: Your BrowserCat API key (required). You can get one for free at https://browsercat.xyz/mcp.

NPX Configuration

{
  "mcpServers": {
    "browsercat": {
      "command": "npx",
      "args": ["-y", "@browsercatco/mcp-server"],
      "env": {
        "BROWSERCAT_API_KEY": "your-api-key-here"
      }
    }
  }
}

License

This MCP server is licensed under the MIT License. This means you are free to use, modify, and distribute the software, subject to the terms and conditions of the MIT License. For more details, please see the LICENSE file in the project repository.

Alternatives

Firecrawl

mendableai

89.6k

Unlock AI-ready web data with Firecrawl: scrape any website, handle dynamic content, and automate web scraping for resea

OfficialPopular

3.0k125

Browser Use

browser-use

79.9k

Browser Use lets LLMs and agents access and scrape any website in real time, making web scraping and web page scraping e

OfficialPopular

36616

Playwright Browser Automation

microsoft

28.4k

Enhance software testing with Playwright MCP: Fast, reliable browser automation, an innovative alternative to Selenium s

OfficialPopular

7.6k545

Chrome DevTools MCP

chromedevtools

28.1k

AI-driven control of live Chrome via Chrome DevTools: browser automation, debugging, performance analysis and network mo

OfficialPopular

50711

Related Skills

Browse all skills

chrome-devtools

Browser automation, debugging, and performance analysis using Puppeteer CLI scripts. Use for automating browsers, taking screenshots, analyzing performance, monitoring network traffic, web scraping, form automation, and JavaScript debugging.

browser-tools

Lightweight Chrome automation toolkit with shared configuration, JSON-first output, and six focused scripts for starting, navigating, inspecting, capturing, evaluating, and cleaning up browser sessions.

browser-setup-devtools

Guide users through browser automation setup using Chrome DevTools MCP as the primary path and the OpenCode browser extension as a fallback. Use when the user asks to set up browser automation, Chrome DevTools MCP, browser MCP, browser extension, or runs the browser-setup command.

crewai-developer

Comprehensive CrewAI framework guide for building collaborative AI agent teams and structured workflows. Use when developing multi-agent systems with CrewAI, creating autonomous AI crews, orchestrating flows, implementing agents with roles and tools, or building production-ready AI automation. Essential for developers building intelligent agent systems, task automation, and complex AI workflows.

browser

This skill should be used for browser automation tasks using Chrome DevTools Protocol (CDP). Triggers when users need to launch Chrome with remote debugging, navigate pages, execute JavaScript in browser context, capture screenshots, or interactively select DOM elements. No MCP server required.

ccxt-typescript

CCXT cryptocurrency exchange library for TypeScript and JavaScript developers (Node.js and browser). Covers both REST API (standard) and WebSocket API (real-time). Helps install CCXT, connect to exchanges, fetch market data, place orders, stream live tickers/orderbooks, handle authentication, and manage errors. Use when working with crypto exchanges in TypeScript/JavaScript projects, trading bots, arbitrage systems, or portfolio management tools. Includes both REST and WebSocket examples.