I craft unique cereal names, stories, and ridiculously cute Cereal Baby images.

browser-automation-server
A Model Context Protocol (MCP) server that provides browser automation capabilities for Claude and other MCP-compatible AI assistants
1
Github Watches
1
Github Forks
0
Github Stars
Browser Automation MCP Server
A Model Context Protocol (MCP) server that provides browser automation capabilities for Claude and other MCP-compatible AI assistants.
Features
- Web Automation: Control web browsers programmatically
- Screenshot Capture: Take screenshots of web pages
- Element Interaction: Click, type, and interact with web elements
- Navigation: Navigate between pages and manage browser state
- Form Filling: Automate form filling and submission
- Data Extraction: Extract data from web pages
Installation
# Clone the repository
git clone https://github.com/samihalawa/browser-automation-server.git
cd browser-automation-server
# Install dependencies
npm install
# Build the server
npm run build
Usage
Starting the Server
npm start
Configuration
Add the server to your MCP configuration:
{
"servers": {
"browser-automation": {
"command": "/path/to/node",
"args": ["/path/to/browser-automation-server/build/index.js"],
"enabled": true,
"port": 3008,
"environment": {
"NODE_PATH": "/path/to/node_modules",
"PATH": "/usr/local/bin:/usr/bin:/bin"
}
}
}
}
Available Tools
navigate
Navigate to a URL.
Parameters:
-
url
(string, required): URL to navigate to -
waitUntil
(string, optional): Navigation wait condition. Options: 'load', 'domcontentloaded', 'networkidle'. Default: 'load'
screenshot
Take a screenshot of the current page.
Parameters:
-
fullPage
(boolean, optional): Whether to capture full page or just viewport. Default: false -
path
(string, optional): Path to save the screenshot to. If not provided, returns base64 encoded image
click
Click on an element.
Parameters:
-
selector
(string, required): CSS selector of the element to click -
waitForSelector
(boolean, optional): Whether to wait for the selector to appear. Default: true
type
Type text into an input field.
Parameters:
-
selector
(string, required): CSS selector of the input element -
text
(string, required): Text to type -
delay
(number, optional): Delay between keystrokes in milliseconds. Default: 0
extract
Extract data from the page.
Parameters:
-
selector
(string, required): CSS selector of the elements to extract -
attribute
(string, optional): Attribute to extract. If not provided, extracts text content
evaluate
Evaluate JavaScript in the browser context.
Parameters:
-
script
(string, required): JavaScript code to evaluate -
args
(array, optional): Arguments to pass to the script
Example Usage
-
Navigate to a website:
navigate(url: "https://example.com")
-
Take a screenshot:
screenshot(fullPage: true)
-
Click a button:
click(selector: "#submit-button")
-
Fill a form:
type(selector: "#username", text: "user123") type(selector: "#password", text: "password123") click(selector: "#login-button")
-
Extract data:
extract(selector: ".product-title", attribute: "innerText")
Requirements
- Node.js 14+
- Playwright for browser automation
License
MIT
相关推荐
I find academic articles and books for research and literature reviews.
Evaluator for marketplace product descriptions, checks for relevancy and keyword stuffing.
Confidential guide on numerology and astrology, based of GG33 Public information
Emulating Dr. Jordan B. Peterson's style in providing life advice and insights.
Your go-to expert in the Rust ecosystem, specializing in precise code interpretation, up-to-date crate version checking, and in-depth source code analysis. I offer accurate, context-aware insights for all your Rust programming questions.
Advanced software engineer GPT that excels through nailing the basics.
Converts Figma frames into front-end code for various mobile frameworks.
Discover the most comprehensive and up-to-date collection of MCP servers in the market. This repository serves as a centralized hub, offering an extensive catalog of open-source and proprietary MCP servers, complete with features, documentation links, and contributors.
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.
Micropython I2C-based manipulation of the MCP series GPIO expander, derived from Adafruit_MCP230xx
Mirror ofhttps://github.com/agentience/practices_mcp_server
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
A unified API gateway for integrating multiple etherscan-like blockchain explorer APIs with Model Context Protocol (MCP) support for AI assistants.
Reviews

user_RmlJHcoD
Browser-automation-server by samihalawa is an exceptional tool for anyone needing efficient browser automation. Its clear documentation and robust functionality make it incredibly user-friendly. The flexibility and ease of setting up tasks with this server have significantly improved my workflow. Highly recommend for both beginners and advanced users!