I craft unique cereal names, stories, and ridiculously cute Cereal Baby images.

Crawl4ai-mcp
Crawl4AI-MCP: Ein leistungsstarker Server für Web-Crawling und Content Analysis, der gezielte Web-Scraping mit Claude AI-Verarbeitung kombiniert. Extrahieren Sie Erkenntnisse aus bestimmten Websites mit anpassbaren Tiefen-, Inhalts- und KI -Analysefunktionen.
3 years
Works with Finder
1
Github Watches
6
Github Forks
26
Github Stars
*** THIS IS NOT A FUNCTIONAL MCP ***
- IT WAS A LEARNING TOOL I WOULD COME BACK TO WHEN I HAD SPARE TIME
Crawl4AI MCP Server
An MCP (Model-Controller-Processor) Server for intelligent web crawling and AI-powered content analysis. This server provides a simple API for crawling websites and processing the content using Claude AI models.
Who Benefits from Crawl4AI?
Crawl4AI is designed for individuals and organizations who need targeted, in-depth analysis of specific web content. Unlike general search engines or AI assistants that provide broad coverage, Crawl4AI offers deeper insights into content you specifically want to analyze.
Ideal for:
- Researchers who need to extract structured information from specific websites or academic resources
- Content creators looking to analyze competitor content or industry trends within specific domains
- Data analysts who need to process web data for business intelligence purposes
- Developers building applications that require web content analysis capabilities
- Digital marketers analyzing industry websites, blogs, or competitor content
- Business analysts gathering industry-specific information from multiple sources
- Knowledge workers who need to synthesize information from specific web domains
How Users Benefit from Crawl4AI
The Crawl4AI MCP server provides significant advantages over general-purpose search and AI tools:
- Targeted depth over breadth: Instead of broad surface-level results across the entire web, get comprehensive analysis of specific websites that matter to you
- Customizable crawling parameters: Control exactly how deep to crawl, what content to extract, and how to process it
- Programmatic integration: Easily incorporate web content analysis into your own applications, workflows, and data pipelines
- Flexible AI processing: Apply different analytical approaches to the same content - summarize, extract facts, deep analysis, or generate questions
- Privacy and control: Keep sensitive searches and analyses private by running the server locally
- Cost efficiency: Use your own Claude API key with precise control over token usage and processing costs
- Automation potential: Schedule regular crawls and analyses of important websites to track changes over time
- Customized AI prompting: Tailor the AI analysis specifically to your needs with customized prompting
- Content transformation: Turn unstructured web content into structured, actionable information
Crawl4AI bridges the gap between simple web scraping and sophisticated AI analysis, enabling more targeted and meaningful extraction of insights from the web.
Features
- Web crawling with customizable depth and content selectors
- Respects robots.txt directives
- Content extraction and processing
- AI-powered analysis of crawled content using Claude models
- Simple REST API
- Configurable via command line or environment variables
- Detailed logging
Installation
-
Clone this repository:
git clone https://github.com/yourusername/crawl4ai-mcp.git cd crawl4ai-mcp
-
Install dependencies:
npm install
-
Create a
.env
file with your Anthropic API key:ANTHROPIC_API_KEY=your_api_key_here
Usage
Starting the Server
Start the server with default settings:
npm start
Or use command-line options:
npm start -- --port 4000 --debug
Available options:
-
--port <number>
: Port to run the server on (default: 3000) -
--debug
: Enable debug logging
API Endpoints
Crawl a Website
POST /api/crawl
Request body:
{
"url": "https://example.com",
"depth": 2,
"selector": "main",
"aiProcessing": {
"task": "summarize",
"model": "claude-3-sonnet-20240229"
}
}
Parameters:
-
url
(required): The URL to start crawling from -
depth
(optional): How many levels deep to crawl (default: 1) -
selector
(optional): CSS selector for content extraction (default: "body") -
aiProcessing
(optional): Configuration for AI processing-
task
: Type of processing (summarize, extract, analyze, questions) -
model
: Claude model to use (default: "claude-3-sonnet-20240229")
-
Health Check
GET /api/healthcheck
Returns server status and version information.
AI Processing Tasks
The server supports several AI processing tasks:
-
summarize
: Create a comprehensive summary of the crawled content -
extract
: Extract factual information from the content -
analyze
: Perform deep analysis of the content, arguments, and quality -
questions
: Generate important questions and answers based on the content
Configuration
You can configure the server using environment variables:
-
PORT
: Server port (default: 3000) -
ANTHROPIC_API_KEY
: Your Anthropic API key for Claude -
DEBUG
: Set to "true" to enable debug logging
Example
Crawl a website and summarize its content:
curl -X POST http://localhost:3000/api/crawl \
-H "Content-Type: application/json" \
-d '{
"url": "https://example.com",
"depth": 1,
"aiProcessing": {
"task": "summarize"
}
}'
License
MIT License
Acknowledgements
This project uses the following libraries:
相关推荐
Confidential guide on numerology and astrology, based of GG33 Public information
Converts Figma frames into front-end code for various mobile frameworks.
Advanced software engineer GPT that excels through nailing the basics.
A world class elite tech co-founder entrepreneur, expert in software development, entrepreneurship, marketing, coaching style leadership and aligned with ambition for excellence, global market penetration and worldy perspectives.
A medical specialist offering assistance grounded in clinical guidelines. Disclaimer: This is intended for research and is NOT safe for clinical use!
A geek-themed horoscope generator blending Bitcoin prices, tech jargon, and astrological whimsy.
Entdecken Sie die umfassendste und aktuellste Sammlung von MCP-Servern auf dem Markt. Dieses Repository dient als zentraler Hub und bietet einen umfangreichen Katalog von Open-Source- und Proprietary MCP-Servern mit Funktionen, Dokumentationslinks und Mitwirkenden.
Ein einheitliches API-Gateway zur Integration mehrerer Ethercan-ähnlicher Blockchain-Explorer-APIs mit Modellkontextprotokoll (MCP) für AI-Assistenten.
Mirror ofhttps: //github.com/suhail-ak-s/mcp-typense-server
本项目是一个钉钉 MCP (Message Connector Protocol )服务 , 提供了与钉钉企业应用交互的 api 接口。项目基于 Go 语言开发 , 支持员工信息查询和消息发送等功能。
Reviews

user_l9mOiMPh
Crawl4AI-MCP by Vistiqx is an exceptional tool for AI-driven web crawling. Its powerful capabilities and user-friendly interface make it a game-changer for developers. Easy to integrate and efficient in fetching data, it transforms web crawling tasks into a seamless experience. Highly recommend! Check it out at https://github.com/Vistiqx/Crawl4AI-MCP.