
BrowserCat
OfficialProvides cloud-based browser automation for LLMs to navigate websites, interact with elements, and take screenshots without installing browsers locally.
Enables LLMs to interact with web pages through cloud-based browser automation for navigation, screenshot capture, element interaction, and JavaScript execution without local browser installation.
What it does
- Navigate to any web page
- Take full page or element screenshots
- Click and hover on page elements
- Fill forms and select dropdown options
- Execute JavaScript in browser console
- Access browser console logs
Best for
About BrowserCat
BrowserCat is an official MCP server published by browsercat that provides AI assistants with tools and capabilities via the Model Context Protocol. BrowserCat offers cloud-based Selenium test automation for software testing, enabling LLM-driven web navigation and inte It is categorized under browser automation, developer tools.
How to install
You can install BrowserCat in your AI client of choice. Use the install panel on this page to get one-click setup for Cursor, Claude Desktop, VS Code, and other MCP-compatible clients. This server runs locally on your machine via the stdio transport.
License
BrowserCat is released under the MIT license. This is a permissive open-source license, meaning you can freely use, modify, and distribute the software.
BrowserCat MCP Server
A Model Context Protocol server that provides browser automation capabilities using BrowserCat's cloud browser service. This server enables LLMs to interact with web pages, take screenshots, and execute JavaScript in a real browser environment without needing to install browsers locally.
Components
Tools
- browsercat_navigate
- Navigate to any URL in the browser
- Input:
url(string)
- browsercat_screenshot
- Capture screenshots of the entire page or specific elements
- Inputs:
name(string, required): Name for the screenshotselector(string, optional): CSS selector for element to screenshotwidth(number, optional, default: 800): Screenshot widthheight(number, optional, default: 600): Screenshot height
- browsercat_click
- Click elements on the page
- Input:
selector(string): CSS selector for element to click
- browsercat_hover
- Hover elements on the page
- Input:
selector(string): CSS selector for element to hover
- browsercat_fill
- Fill out input fields
- Inputs:
selector(string): CSS selector for input fieldvalue(string): Value to fill
- browsercat_select
- Select an option from a dropdown menu
- Inputs:
selector(string): CSS selector for select elementvalue(string): Value to select
- browsercat_evaluate
- Execute JavaScript in the browser console
- Input:
script(string): JavaScript code to execute
Resources
The server provides access to two types of resources:
- Console Logs (
console://logs)- Browser console output in text format
- Includes all console messages from the browser
- Screenshots (
screenshot://<name>)- PNG images of captured screenshots
- Accessible via the screenshot name specified during capture
Key Features
- Cloud-based browser automation
- No local browser installation required
- Console log monitoring
- Screenshot capabilities
- JavaScript execution
- Basic web interaction (navigation, clicking, form filling)
Configuration to use BrowserCat MCP Server
Environment Variables
The BrowserCat MCP server requires the following environment variable:
BROWSERCAT_API_KEY: Your BrowserCat API key (required). You can get one for free at https://browsercat.xyz/mcp.
NPX Configuration
{
"mcpServers": {
"browsercat": {
"command": "npx",
"args": ["-y", "@browsercatco/mcp-server"],
"env": {
"BROWSERCAT_API_KEY": "your-api-key-here"
}
}
}
}
License
This MCP server is licensed under the MIT License. This means you are free to use, modify, and distribute the software, subject to the terms and conditions of the MIT License. For more details, please see the LICENSE file in the project repository.
Alternatives
Related Skills
Browse all skillsBrowser automation, debugging, and performance analysis using Puppeteer CLI scripts. Use for automating browsers, taking screenshots, analyzing performance, monitoring network traffic, web scraping, form automation, and JavaScript debugging.
Lightweight Chrome automation toolkit with shared configuration, JSON-first output, and six focused scripts for starting, navigating, inspecting, capturing, evaluating, and cleaning up browser sessions.
Guide users through browser automation setup using Chrome DevTools MCP as the primary path and the OpenCode browser extension as a fallback. Use when the user asks to set up browser automation, Chrome DevTools MCP, browser MCP, browser extension, or runs the browser-setup command.
Comprehensive CrewAI framework guide for building collaborative AI agent teams and structured workflows. Use when developing multi-agent systems with CrewAI, creating autonomous AI crews, orchestrating flows, implementing agents with roles and tools, or building production-ready AI automation. Essential for developers building intelligent agent systems, task automation, and complex AI workflows.
This skill should be used for browser automation tasks using Chrome DevTools Protocol (CDP). Triggers when users need to launch Chrome with remote debugging, navigate pages, execute JavaScript in browser context, capture screenshots, or interactively select DOM elements. No MCP server required.
CCXT cryptocurrency exchange library for TypeScript and JavaScript developers (Node.js and browser). Covers both REST API (standard) and WebSocket API (real-time). Helps install CCXT, connect to exchanges, fetch market data, place orders, stream live tickers/orderbooks, handle authentication, and manage errors. Use when working with crypto exchanges in TypeScript/JavaScript projects, trading bots, arbitrage systems, or portfolio management tools. Includes both REST and WebSocket examples.