Cover image
Try Now
2025-04-06

MCP Smart Crawler is a Model Context Protocol (MCP) server uses Playwright to crawl web content, extract metadata, and download resources such as videos and images.

3 years

Works with Finder

1

Github Watches

1

Github Forks

1

Github Stars

中文

MCP Smart Crawler

MCP Smart Crawler is a Model Context Protocol (MCP) server uses Playwright to crawl web content, extract metadata, and download resources such as videos and images.

Key Features

  • Extract metadata (title, description, images) from Xiaohongshu (小红书) posts.
  • Download videos and images from Xiaohongshu share links.
  • Uses Playwright for browser automation.

How to Use (MCP Client Configuration)

To use this server with your MCP client, add the following configuration to your client's MCP server settings. You will need to adjust the command and args based on how you run the server script.

Example Configuration:

{
  "mcpServers": {
    "mcp-smart-crawler": {
      "command": "npx",
      "args": [
        "-y",
        "mcp-smart-crawler",
        "--download-folder", // optional
        "c:\\downloads" // optional
      ],
    }
  }
}

Example chat:

帮我查看这条小红书的内容和图片,并告诉我图片里面是什么
59 XXX发布了一篇小红书笔记,快来看吧! 😆 OfTOBst2PsxctaX 😆 http://xhslink.com/a/xxaabbcczz,复制本条信息,打开【小红书】App查看精彩内容!



MCP Smart Crawler

MCP Smart Crawler 是一个模型上下文协议(MCP)服务器,使用 Playwright 来爬取网页内容,提取元数据,并下载资源如视频和图片。

主要功能

  • 从小红书(Xiaohongshu)帖子中提取元数据(标题、描述、图片)。
  • 从小红书分享链接中下载视频和图片。
  • 使用Playwright进行浏览器自动化。

使用方法(MCP客户端配置)

要在您的MCP客户端中使用此服务器,请将以下配置添加到客户端的MCP服务器设置中。您需要根据服务器脚本的运行方式调整 commandargs

示例配置:

{
  "mcpServers": {
    "mcp-smart-crawler": {
      "command": "npx",
      "args": [
        "-y",
        "mcp-smart-crawler",
        "--download-folder", // optional
        "c:\\downloads" // optional
      ],
    }
  }
}

示例对话:

帮我查看这条小红书的内容和图片,并告诉我图片里面是什么
59 XXX发布了一篇小红书笔记,快来看吧! 😆 OfTOBst2PsxctaX 😆 http://xhslink.com/a/xxaabbcczz,复制本条信息,打开【小红书】App查看精彩内容!

相关推荐

  • Joshua Armstrong
  • Confidential guide on numerology and astrology, based of GG33 Public information

  • https://suefel.com
  • Latest advice and best practices for custom GPT development.

  • Emmet Halm
  • Converts Figma frames into front-end code for various mobile frameworks.

  • Elijah Ng Shi Yi
  • Advanced software engineer GPT that excels through nailing the basics.

  • Lists Tailwind CSS classes in monospaced font

  • https://maiplestudio.com
  • Find Exhibitors, Speakers and more

  • lumpenspace
  • Take an adjectivised noun, and create images making it progressively more adjective!

  • https://appia.in
  • Siri Shortcut Finder – your go-to place for discovering amazing Siri Shortcuts with ease

  • Carlos Ferrin
  • Encuentra películas y series en plataformas de streaming.

  • Yusuf Emre Yeşilyurt
  • I find academic articles and books for research and literature reviews.

  • tomoyoshi hirata
  • Sony α7IIIマニュアルアシスタント

  • apappascs
  • Discover the most comprehensive and up-to-date collection of MCP servers in the market. This repository serves as a centralized hub, offering an extensive catalog of open-source and proprietary MCP servers, complete with features, documentation links, and contributors.

  • ShrimpingIt
  • Micropython I2C-based manipulation of the MCP series GPIO expander, derived from Adafruit_MCP230xx

  • jae-jae
  • MCP server for fetch web page content using Playwright headless browser.

  • ravitemer
  • A powerful Neovim plugin for managing MCP (Model Context Protocol) servers

  • patruff
  • Bridge between Ollama and MCP servers, enabling local LLMs to use Model Context Protocol tools

  • pontusab
  • The Cursor & Windsurf community, find rules and MCPs

  • JackKuo666
  • 🔍 Enabling AI assistants to search and access PyPI package information through a simple MCP interface.

  • av
  • Effortlessly run LLM backends, APIs, frontends, and services with one command.

    Reviews

    3 (1)
    Avatar
    user_RuF1Cmh8
    2025-04-18

    I recently started using mcp-smart-crawler by loo-y, and I am thoroughly impressed with its performance. The ability to effectively crawl and extract data from multiple sources with ease is a game-changer for my projects. The user-friendly interface and detailed documentation available on their GitHub page make it accessible even for beginners. Highly recommend to anyone looking for a robust web crawling solution! Check it out at https://github.com/loo-y/mcp-smart-crawler.