BrowserCat

BrowserCat

Official
browsercat

Provides cloud-based browser automation for LLMs to navigate websites, interact with elements, and take screenshots without installing browsers locally.

Enables LLMs to interact with web pages through cloud-based browser automation for navigation, screenshot capture, element interaction, and JavaScript execution without local browser installation.

5427 views10Local (stdio)

What it does

  • Navigate to any web page
  • Take full page or element screenshots
  • Click and hover on page elements
  • Fill forms and select dropdown options
  • Execute JavaScript in browser console
  • Access browser console logs

Best for

Web scraping and data extractionAutomated testing of web applicationsAI agents that need to interact with websitesContent monitoring and verification
Cloud-based — no local browser neededReal browser environmentScreenshot capture with element targeting

About BrowserCat

BrowserCat is an official MCP server published by browsercat that provides AI assistants with tools and capabilities via the Model Context Protocol. BrowserCat offers cloud-based Selenium test automation for software testing, enabling LLM-driven web navigation and inte It is categorized under browser automation, developer tools.

How to install

You can install BrowserCat in your AI client of choice. Use the install panel on this page to get one-click setup for Cursor, Claude Desktop, VS Code, and other MCP-compatible clients. This server runs locally on your machine via the stdio transport.

License

BrowserCat is released under the MIT license. This is a permissive open-source license, meaning you can freely use, modify, and distribute the software.

BrowserCat MCP Server

A Model Context Protocol server that provides browser automation capabilities using BrowserCat's cloud browser service. This server enables LLMs to interact with web pages, take screenshots, and execute JavaScript in a real browser environment without needing to install browsers locally.

Components

Tools

  • browsercat_navigate
    • Navigate to any URL in the browser
    • Input: url (string)
  • browsercat_screenshot
    • Capture screenshots of the entire page or specific elements
    • Inputs:
      • name (string, required): Name for the screenshot
      • selector (string, optional): CSS selector for element to screenshot
      • width (number, optional, default: 800): Screenshot width
      • height (number, optional, default: 600): Screenshot height
  • browsercat_click
    • Click elements on the page
    • Input: selector (string): CSS selector for element to click
  • browsercat_hover
    • Hover elements on the page
    • Input: selector (string): CSS selector for element to hover
  • browsercat_fill
    • Fill out input fields
    • Inputs:
      • selector (string): CSS selector for input field
      • value (string): Value to fill
  • browsercat_select
    • Select an option from a dropdown menu
    • Inputs:
      • selector (string): CSS selector for select element
      • value (string): Value to select
  • browsercat_evaluate
    • Execute JavaScript in the browser console
    • Input: script (string): JavaScript code to execute

Resources

The server provides access to two types of resources:

  1. Console Logs (console://logs)
    • Browser console output in text format
    • Includes all console messages from the browser
  2. Screenshots (screenshot://<name>)
    • PNG images of captured screenshots
    • Accessible via the screenshot name specified during capture

Key Features

  • Cloud-based browser automation
  • No local browser installation required
  • Console log monitoring
  • Screenshot capabilities
  • JavaScript execution
  • Basic web interaction (navigation, clicking, form filling)

Configuration to use BrowserCat MCP Server

Environment Variables

The BrowserCat MCP server requires the following environment variable:

NPX Configuration

{
  "mcpServers": {
    "browsercat": {
      "command": "npx",
      "args": ["-y", "@browsercatco/mcp-server"],
      "env": {
        "BROWSERCAT_API_KEY": "your-api-key-here"
      }
    }
  }
}

License

This MCP server is licensed under the MIT License. This means you are free to use, modify, and distribute the software, subject to the terms and conditions of the MIT License. For more details, please see the LICENSE file in the project repository.

Alternatives

Related Skills

Browse all skills
chrome-devtools

Browser automation, debugging, and performance analysis using Puppeteer CLI scripts. Use for automating browsers, taking screenshots, analyzing performance, monitoring network traffic, web scraping, form automation, and JavaScript debugging.

30
browser-tools

Lightweight Chrome automation toolkit with shared configuration, JSON-first output, and six focused scripts for starting, navigating, inspecting, capturing, evaluating, and cleaning up browser sessions.

5
browser-setup-devtools

Guide users through browser automation setup using Chrome DevTools MCP as the primary path and the OpenCode browser extension as a fallback. Use when the user asks to set up browser automation, Chrome DevTools MCP, browser MCP, browser extension, or runs the browser-setup command.

3
crewai-developer

Comprehensive CrewAI framework guide for building collaborative AI agent teams and structured workflows. Use when developing multi-agent systems with CrewAI, creating autonomous AI crews, orchestrating flows, implementing agents with roles and tools, or building production-ready AI automation. Essential for developers building intelligent agent systems, task automation, and complex AI workflows.

2
browser

This skill should be used for browser automation tasks using Chrome DevTools Protocol (CDP). Triggers when users need to launch Chrome with remote debugging, navigate pages, execute JavaScript in browser context, capture screenshots, or interactively select DOM elements. No MCP server required.

2
ccxt-typescript

CCXT cryptocurrency exchange library for TypeScript and JavaScript developers (Node.js and browser). Covers both REST API (standard) and WebSocket API (real-time). Helps install CCXT, connect to exchanges, fetch market data, place orders, stream live tickers/orderbooks, handle authentication, and manage errors. Use when working with crypto exchanges in TypeScript/JavaScript projects, trading bots, arbitrage systems, or portfolio management tools. Includes both REST and WebSocket examples.

1