Cover image
Try Now
2025-03-24

A Model Context Protocol (MCP) server for Windows desktop automation using AutoIt.

3 years

Works with Finder

1

Github Watches

6

Github Forks

50

Github Stars

MCP Windows Desktop Automation

A Model Context Protocol (MCP) server for Windows desktop automation using AutoIt.

Overview

This project provides a TypeScript MCP server that wraps the node-autoit-koffi package, allowing LLM applications to automate Windows desktop tasks through the MCP protocol.

The server exposes:

  • Tools: All AutoIt functions as MCP tools
  • Resources: File access and screenshot capabilities
  • Prompts: Templates for common automation tasks

Features

  • Full wrapping of all AutoIt functions as MCP tools
  • Support for both stdio and WebSocket transports
  • File access resources for reading files and directories
  • Screenshot resources for capturing the screen or specific windows
  • Prompt templates for common automation tasks
  • Strict TypeScript typing throughout

Installation

# Clone the repository
git clone https://github.com/yourusername/mcp-windows-desktop-automation.git
cd mcp-windows-desktop-automation

# Install dependencies
npm install

# Build the project
npm run build

Usage

Starting the Server

# Start with stdio transport (default)
npm start

# Start with WebSocket transport
npm start -- --transport=websocket --port=3000

# Enable verbose logging
npm start -- --verbose

Command Line Options

  • --transport=stdio|websocket: Specify the transport protocol (default: stdio)
  • --port=<number>: Specify the port for WebSocket transport (default: 3000)
  • --verbose: Enable verbose logging

Tools

The server provides tools for:

  • Mouse operations: Move, click, drag, etc.
  • Keyboard operations: Send keystrokes, clipboard operations, etc.
  • Window management: Find, activate, close, resize windows, etc.
  • Control manipulation: Interact with UI controls, buttons, text fields, etc.
  • Process management: Start, stop, and monitor processes
  • System operations: Shutdown, sleep, etc.

Resources

The server provides resources for:

  • File access: Read files and list directories
  • Screenshots: Capture the screen or specific windows

Prompts

The server provides prompt templates for:

  • Window interaction: Find and interact with windows
  • Form filling: Automate form filling tasks
  • Automation tasks: Create scripts for repetitive tasks
  • Monitoring: Wait for specific conditions

Development

# Run in development mode
npm run dev

# Lint the code
npm run lint

# Run tests
npm run test

License

MIT

相关推荐

  • Joshua Armstrong
  • Confidential guide on numerology and astrology, based of GG33 Public information

  • https://suefel.com
  • Latest advice and best practices for custom GPT development.

  • Emmet Halm
  • Converts Figma frames into front-end code for various mobile frameworks.

  • Elijah Ng Shi Yi
  • Advanced software engineer GPT that excels through nailing the basics.

  • https://maiplestudio.com
  • Find Exhibitors, Speakers and more

  • Lists Tailwind CSS classes in monospaced font

  • lumpenspace
  • Take an adjectivised noun, and create images making it progressively more adjective!

  • https://appia.in
  • Siri Shortcut Finder – your go-to place for discovering amazing Siri Shortcuts with ease

  • Carlos Ferrin
  • Encuentra películas y series en plataformas de streaming.

  • Yusuf Emre Yeşilyurt
  • I find academic articles and books for research and literature reviews.

  • tomoyoshi hirata
  • Sony α7IIIマニュアルアシスタント

  • apappascs
  • Discover the most comprehensive and up-to-date collection of MCP servers in the market. This repository serves as a centralized hub, offering an extensive catalog of open-source and proprietary MCP servers, complete with features, documentation links, and contributors.

  • ShrimpingIt
  • Micropython I2C-based manipulation of the MCP series GPIO expander, derived from Adafruit_MCP230xx

  • jae-jae
  • MCP server for fetch web page content using Playwright headless browser.

  • ravitemer
  • A powerful Neovim plugin for managing MCP (Model Context Protocol) servers

  • patruff
  • Bridge between Ollama and MCP servers, enabling local LLMs to use Model Context Protocol tools

  • pontusab
  • The Cursor & Windsurf community, find rules and MCPs

  • av
  • Effortlessly run LLM backends, APIs, frontends, and services with one command.

  • WangRongsheng
  • 🧑‍🚀 全世界最好的LLM资料总结(Agent框架、辅助编程、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.

  • Mintplex-Labs
  • The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.

  • appcypher
  • Awesome MCP Servers - A curated list of Model Context Protocol servers

    Reviews

    1 (1)
    Avatar
    user_b2ueskZZ
    2025-04-17

    I've been using mcp-windows-desktop-automation by mario-andreschak and it has significantly improved my productivity. The automation capabilities are robust and user-friendly, making repetitive tasks a breeze. Highly recommend for anyone looking to streamline their Windows desktop operations. Check it out on GitHub!