MCP cover image
See in Github
2025-03-30

Claude和Gemini 2.5等AI模型的浏览器自动化MCP服务器,通过自然语言启用Web浏览功能

1

Github Watches

0

Github Forks

0

Github Stars

Browser-use-claude-mcp

A browser automation MCP server for AI models like Claude and Gemini 2.5, enabling web browsing capabilities through natural language.

Overview

This project implements a Model Context Protocol (MCP) server that provides browser automation capabilities to AI models. It allows AI assistants to browse the web, interact with websites, and extract information using natural language commands.

Key Features

🌐 Browser Automation Features

  • Full browser automation (navigation, form filling, clicking, etc.)
  • Web search capabilities
  • Screenshot capture for visual understanding
  • Content extraction and analysis

🤖 AI Features

  • Support for multiple AI providers:
    • Google Gemini 2.5 (primary focus)
    • Anthropic Claude
    • OpenAI
  • Image analysis (vision) capabilities
  • AI-powered content analysis

🔧 Technical Features

  • Written in TypeScript for maximum reliability
  • Modular architecture with clean separation of concerns
  • Comprehensive logging and error handling
  • Easy configuration through environment variables

Available Tools

Tool Name Description
browse_webpage Navigate to a URL and extract its content
search_web Perform a web search and return results
take_screenshot Capture a screenshot of the current page
click_element Click on an element by text or selector
fill_form Fill out form fields with provided values
extract_content Extract specific content from a webpage
analyze_content AI-powered analysis of webpage content

Getting Started

See INSTALL.md for detailed installation and setup instructions.

Quick Start

  1. Clone the repository

    git clone https://github.com/jasondsmith72/Browser-use-claude-mcp.git
    cd Browser-use-claude-mcp
    
  2. Install dependencies

    npm install
    
  3. Create a .env file (use .env.example as a template)

    cp .env.example .env
    
  4. Build the project

    npm run build
    
  5. Start the server

    npm start
    

Configuration

The server can be configured through environment variables in your .env file:

# Browser configuration
CHROME_PATH=
CHROME_USER_DATA=
CHROME_DEBUGGING_PORT=9222

# AI provider (GEMINI, ANTHROPIC, OPENAI)
MCP_MODEL_PROVIDER=GEMINI

# API keys (use the one for your chosen provider)
GOOGLE_API_KEY=your_google_api_key_here
ANTHROPIC_API_KEY=your_anthropic_api_key_here
OPENAI_API_KEY=your_openai_api_key_here

Using with Claude Desktop

  1. Locate the Claude Desktop configuration file:

    • Windows: %APPDATA%/Claude/claude_desktop_config.json
    • MacOS: ~/Library/Application Support/Claude/claude_desktop_config.json
  2. Add this MCP server to your configuration:

    {
      "mcpServers": {
        "browser-use-claude-mcp": {
          "command": "node",
          "args": [
            "/path/to/Browser-use-claude-mcp/dist/index.js"
          ],
          "env": {
            "CHROME_PATH": "",
            "CHROME_USER_DATA": "",
            "MCP_MODEL_PROVIDER": "GEMINI",
            "GOOGLE_API_KEY": "your_google_api_key_here"
          }
        }
      }
    }
    
  3. Restart Claude Desktop for the changes to take effect.

Examples

Basic Web Browsing

browse_webpage(url="https://example.com")

Web Search

search_web(query="best programming languages 2025")

Filling a Form

fill_form(fields={
  "name": "John Doe",
  "email": "john@example.com",
  "message": "Hello world!"
}, submit=true)

AI Content Analysis

analyze_content(
  url="https://en.wikipedia.org/wiki/Artificial_intelligence",
  instructions="Summarize the key developments in AI in the last decade"
)

Development

# Run in development mode
npm run dev

# Run tests
npm test

# Lint code
npm run lint

License

MIT

Credits

This project builds upon the work of browser-use and other MCP server implementations.

相关推荐

  • https://suefel.com
  • Latest advice and best practices for custom GPT development.

  • Yusuf Emre Yeşilyurt
  • I find academic articles and books for research and literature reviews.

  • https://maiplestudio.com
  • Find Exhibitors, Speakers and more

  • Carlos Ferrin
  • Encuentra películas y series en plataformas de streaming.

  • Joshua Armstrong
  • Confidential guide on numerology and astrology, based of GG33 Public information

  • Alexandru Strujac
  • Efficient thumbnail creator for YouTube videos

  • Emmet Halm
  • Converts Figma frames into front-end code for various mobile frameworks.

  • Elijah Ng Shi Yi
  • Advanced software engineer GPT that excels through nailing the basics.

  • lumpenspace
  • Take an adjectivised noun, and create images making it progressively more adjective!

  • Lists Tailwind CSS classes in monospaced font

  • https://zenepic.net
  • Embark on a thrilling diplomatic quest across a galaxy on the brink of war. Navigate complex politics and alien cultures to forge peace and avert catastrophe in this immersive interstellar adventure.

  • apappascs
  • 发现市场上最全面,最新的MCP服务器集合。该存储库充当集中式枢纽,提供了广泛的开源和专有MCP服务器目录,并提供功能,文档链接和贡献者。

  • ShrimpingIt
  • MCP系列GPIO Expander的基于Micropython I2C的操作,源自ADAFRUIT_MCP230XX

  • modelcontextprotocol
  • 模型上下文协议服务器

  • Mintplex-Labs
  • 带有内置抹布,AI代理,无代理构建器,MCP兼容性等的多合一桌面和Docker AI应用程序。

  • ravitemer
  • 一个功能强大的Neovim插件,用于管理MCP(模型上下文协议)服务器

    Reviews

    3 (1)
    Avatar
    user_dvVq4d9t
    2025-04-16

    Browser-use-claude-mcp is a fantastic tool for integrating AI into your browsing experience. Created by jasondsmith72, this application simplifies the usage of Claude for browser activities in a seamless way. Its user-friendly approach and easy installation make it a must-have for anyone looking to enhance their productivity and AI capabilities directly from their browser. Highly recommended!