
arxiv-mcp
MCP Server pour récupérer les articles d'ARXIV en fonction des mots clés
3 years
Works with Finder
0
Github Watches
0
Github Forks
0
Github Stars
ArXiv-MCP: Academic Paper Search for AI Agents
A Model Context Protocol (MCP) server implementation that enables AI agents to search, retrieve, and analyze academic papers from arXiv, the popular open-access repository of electronic preprints.
Overview
This project provides an MCP server that allows AI agents to interact with the arXiv repository, search for papers based on keywords, retrieve detailed information about specific papers, and even extract and analyze the content of papers. It serves as both a practical tool for research assistance and a reference implementation for building MCP servers.
The implementation follows the best practices laid out for building MCP servers, allowing seamless integration with any MCP-compatible client.
Features
The server provides several powerful tools for academic research:
-
search_papers
: Search for papers on arXiv using keywords and get comprehensive summaries -
get_paper_details
: Retrieve detailed information about a specific paper by its arXiv ID -
extract_paper_content
: Download and extract the full text content from a paper's PDF -
analyze_paper
: Analyze a paper's content and generate a comprehensive summary
Prerequisites
- Python 3.11+
- Docker if running the MCP server as a container (recommended)
Installation
Using uv
-
Install uv if you don't have it:
pip install uv
-
Clone this repository:
git clone https://github.com/kelvingao/arxiv-mcp.git cd arxiv-mcp
-
Install dependencies:
uv pip install -e .
-
Create a
.env
file based on.env.example
:cp .env.example .env
-
Configure your environment variables in the
.env
file (see Configuration section)
Using Docker (Recommended)
-
Build the Docker image:
docker build -t mcp/arxiv --build-arg PORT=8050 .
-
Create a
.env
file based on.env.example
and configure your environment variables
Configuration
The following environment variables can be configured in your .env
file:
Variable | Description | Example |
---|---|---|
TRANSPORT |
Transport protocol (sse or stdio) | sse |
HOST |
Host to bind to when using SSE transport | 0.0.0.0 |
PORT |
Port to listen on when using SSE transport | 8050 |
Running the Server
Using uv
SSE Transport
# Set TRANSPORT=sse in .env then:
python src/server.py
The MCP server will run as an API endpoint that you can connect to with the configuration shown below.
Stdio Transport
With stdio, the MCP client itself can spin up the MCP server, so nothing to run at this point.
Using Docker
SSE Transport
docker run --env-file .env -p 8050:8050 mcp/arxiv
The MCP server will run as an API endpoint within the container that you can connect to with the configuration shown below.
Stdio Transport
With stdio, the MCP client itself can spin up the MCP server container, so nothing to run at this point.
Integration with MCP Clients
SSE Configuration
Once you have the server running with SSE transport, you can connect to it using this configuration:
{
"mcpServers": {
"arxiv": {
"transport": "sse",
"url": "http://localhost:8050/sse"
}
}
}
Note for Windsurf users: Use
serverUrl
instead ofurl
in your configuration:{ "mcpServers": { "arxiv": { "transport": "sse", "serverUrl": "http://localhost:8050/sse" } } }
Note for n8n users: Use host.docker.internal instead of localhost since n8n has to reach outside of its own container to the host machine:
So the full URL in the MCP node would be: http://host.docker.internal:8050/sse
Make sure to update the port if you are using a value other than the default 8050.
Python with Stdio Configuration
Add this server to your MCP configuration for Claude Desktop, Windsurf, or any other MCP client:
{
"mcpServers": {
"arxiv": {
"command": "your/path/to/arxiv-mcp/.venv/bin/python",
"args": ["your/path/to/arxiv-mcp/src/main.py"],
"env": {
"TRANSPORT": "stdio"
}
}
}
}
Docker with Stdio Configuration
{
"mcpServers": {
"arxiv": {
"command": "docker",
"args": ["run", "--rm", "-i",
"-e", "TRANSPORT",
"mcp/arxiv"],
"env": {
"TRANSPORT": "stdio"
}
}
}
}
Usage Examples
Here are some examples of how to use the arXiv MCP server with an AI agent:
Searching for Papers
Find recent papers about quantum computing published in the last year.
Getting Paper Details
Get details for the paper with arXiv ID 2303.08774
Extracting Paper Content
Extract the full text from the paper with arXiv ID 2303.08774
Analyzing a Paper
Analyze the methodology section of the paper with arXiv ID 2303.08774
Building Your Own MCP Server
This implementation provides a foundation for building more complex MCP servers. To build your own:
- Add your own tools by creating methods with the
@mcp.tool()
decorator - Create your own lifespan function to add your own dependencies (clients, database connections, etc.)
- Modify the existing tools or add new ones to enhance functionality
- Add prompts and resources with
@mcp.resource()
and@mcp.prompt()
License
Acknowledgements
- arXiv for providing open access to research papers
- The Model Context Protocol team for creating the MCP standard
相关推荐
🔥 1Panel fournit une interface Web intuitive et un serveur MCP pour gérer des sites Web, des fichiers, des conteneurs, des bases de données et des LLM sur un serveur Linux.
⛓️RULEGO est un cadre de moteur de règle d'orchestration des composants de nouvelle génération légère, intégrée, intégrée et de nouvelle génération pour GO.
🧑🚀 全世界最好的 LLM 资料总结 (数据处理、模型训练、模型部署、 O1 模型、 MCP 、小语言模型、视觉语言模型) | Résumé des meilleures ressources LLM du monde.
Flock est une plate-forme à faible code de workflow pour construire rapidement des chatbots, un chiffon et coordonner des équipes multi-agents, alimentée par Langgraph, Langchain, Fastapi et NextJs. (Flock 是一个基于 Workflow 工作流的低代码平台 , 用于快速构建聊天机器人、 Rag 、 Agent 和 和 , 采用 采用 采用 采用 Langgraph 、 Langchain 、 Fastapi 和 ,))
PDF Traduction de papier scientifique avec formats conservés - 基于 AI 完整保留排版的 PDF 文档全文双语翻译 , 支持 Google / Deepl / Olllama / Openai 等服务 , 提供 CLI / GUI / MCP / DOCKER / ZOTERO
Plateforme d'automatisation de workflow à code équitable avec des capacités d'IA natives. Combinez le bâtiment visuel avec du code personnalisé, de l'auto-hôte ou du cloud, 400+ intégrations.
Créez facilement des outils et des agents LLM à l'aide de fonctions Plain Bash / JavaScript / Python.
😎简单易用、🧩丰富生态 - 大模型原生即时通信机器人平台 | 适配 QQ / 微信 (企业微信、个人微信) / 飞书 / 钉钉 / Discord / Telegram / Slack 等平台 | 支持 Chatgpt 、 Deepseek 、 Dify 、 Claude 、 GEMINI 、 XAI 、 PPIO 、 OLLAMA 、 LM Studio 、阿里云百炼、火山方舟、 Siliconflow 、 Qwen 、 Moonshot 、 ChatGlm 、 Sillytraven 、 MCP 等 LLM 的机器人 / Agent | Plateforme de bots de messagerie instantanée basés sur LLM, prend en charge Discord, Telegram, WeChat, Lark, Dingtalk, QQ, Slack
Une passerelle basée sur un plugin qui orchestre d'autres MCP et permet aux développeurs de s'appuyer sur des agents de qualité d'entreprise informatiques.
Reviews

user_guuyMRnx
As a dedicated user of arxiv-mcp, I can confidently say this tool is a game-changer for managing arXiv papers. Created by the talented kelvingao, it streamlines the process of fetching and organizing research, making my academic work much more efficient. Highly recommended for researchers looking for a robust and user-friendly solution!

user_bvenMy3y
I recently started using arxiv-mcp by kelvingao, and I must say it's fantastic! The functionality and seamless integration with research papers on arXiv greatly enhance my workflow. It's evident that the author put a lot of thought into making this tool both user-friendly and efficient. Highly recommend it to any researcher or academician looking to streamline their study process!