
Selenium
Automates web browsers through Selenium WebDriver, allowing AI agents to click buttons, fill forms, navigate pages, and interact with websites programmatically.
3731,147 views115Local (stdio)
What it does
- Launch Chrome, Firefox, Edge, or Safari browsers
- Navigate to URLs and click elements on web pages
- Fill forms and type text into input fields
- Extract text content from web page elements
- Perform drag-and-drop and hover interactions
- Execute right-clicks and double-clicks on elements
Best for
AI agents performing web-based tasks and workflowsAutomated testing of web applicationsWeb scraping and data extraction from interactive sitesBrowser-based automation without manual scripting
Works with major browsers including SafariNo manual scripting required - just tell the AI what to do10+ browser interaction tools
Tools (14)
start_browser
launches browser
navigate
navigates to a URL
find_element
finds an element
click_element
clicks an element
send_keys
sends keys to an element, aka typing