Cover image
Try Now
2025-03-14

3 years

Works with Finder

1

Github Watches

1

Github Forks

2

Github Stars

Audio MCP Server

An MCP (Model Context Protocol) server that provides audio input/output capabilities for AI assistants like Claude. This server enables Claude to interact with your computer's audio system, including recording from microphones and playing audio through speakers.

Features

  • List Audio Devices: View all available microphones and speakers on your system
  • Record Audio: Capture audio from any microphone with customizable duration and quality
  • Playback Recordings: Play back your most recent recording
  • Audio File Playback: Play audio files through your speakers
  • Text-to-Speech: (Placeholder for future implementation)

Requirements

  • Python 3.8 or higher
  • Audio input/output devices on your system

Installation

  1. Clone this repository or download the files to your computer:
git clone https://github.com/GongRzhe/Audio-MCP-Server.git
cd Audio-MCP-Server
  1. Create a virtual environment and install dependencies:
# Windows
python -m venv .venv
.venv\Scripts\activate
pip install -r requirements.txt

# macOS/Linux
python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt
  1. Or use the included setup script to automate installation:
python setup_mcp.py

Configuration

Claude Desktop Configuration

To use this server with Claude Desktop, add the following to your Claude Desktop configuration file:

  • macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
  • Windows: %APPDATA%\Claude\claude_desktop_config.json
{
  "mcpServers": {
    "audio-interface": {
      "command": "/path/to/your/.venv/bin/python",
      "args": [
        "/path/to/your/audio_server.py"
      ],
      "env": {
        "PYTHONPATH": "/path/to/your/audio-mcp-server"
      }
    }
  }
}

Replace the paths with the actual paths on your system. The setup script will generate this configuration for you.

Usage

After setting up the server, restart Claude Desktop. You should see a hammer icon in the input box, indicating that tools are available.

Try asking Claude:

  • "What microphones and speakers are available on my system?"
  • "Record 5 seconds of audio from my microphone."
  • "Play back the audio recording."
  • "Play an audio file from my computer."

Available Tools

list_audio_devices

Lists all available audio input and output devices on your system.

record_audio

Records audio from your microphone.

Parameters:

  • duration: Recording duration in seconds (default: 5)
  • sample_rate: Sample rate in Hz (default: 44100)
  • channels: Number of audio channels (default: 1)
  • device_index: Specific input device index to use (default: system default)

play_latest_recording

Plays back the most recently recorded audio.

play_audio

Placeholder for text-to-speech functionality.

Parameters:

  • text: The text to convert to speech
  • voice: The voice to use (default: "default")

play_audio_file

Plays an audio file through your speakers.

Parameters:

  • file_path: Path to the audio file
  • device_index: Specific output device index to use (default: system default)

Troubleshooting

No devices found

If no audio devices are found, check:

  • Your microphone and speakers are properly connected
  • Your operating system recognizes the devices
  • You have the necessary permissions to access audio devices

Playback issues

If audio playback isn't working:

  • Check your volume settings
  • Ensure the correct output device is selected
  • Try restarting the Claude Desktop application

Server connectivity

If Claude can't connect to the server:

  • Verify your configuration paths are correct
  • Ensure Python and all dependencies are installed
  • Check Claude's logs for error messages

License

MIT

Acknowledgments


Note: This server provides tools that can access your microphone and speakers. Always review and approve tool actions before they execute.

相关推荐

  • NiKole Maxwell
  • I craft unique cereal names, stories, and ridiculously cute Cereal Baby images.

  • Joshua Armstrong
  • Confidential guide on numerology and astrology, based of GG33 Public information

  • https://suefel.com
  • Latest advice and best practices for custom GPT development.

  • Callycode Limited
  • A geek-themed horoscope generator blending Bitcoin prices, tech jargon, and astrological whimsy.

  • Alexandru Strujac
  • Efficient thumbnail creator for YouTube videos

  • Emmet Halm
  • Converts Figma frames into front-end code for various mobile frameworks.

  • Beniyam Berhanu
  • Therapist adept at identifying core issues and offering practical advice with images.

  • Khalid kalib
  • Write professional emails

  • https://tovuti.be
  • Oede knorrepot die vasthoudt an de goeie ouwe tied van 't boerenleven

  • ANGEL LEON
  • A world class elite tech co-founder entrepreneur, expert in software development, entrepreneurship, marketing, coaching style leadership and aligned with ambition for excellence, global market penetration and worldy perspectives.

  • INFOLAB OPERATIONS 2
  • A medical specialist offering assistance grounded in clinical guidelines. Disclaimer: This is intended for research and is NOT safe for clinical use!

  • apappascs
  • 发现市场上最全面,最新的MCP服务器集合。该存储库充当集中式枢纽,提供了广泛的开源和专有MCP服务器目录,并提供功能,文档链接和贡献者。

  • ShrimpingIt
  • MCP系列GPIO Expander的基于Micropython I2C的操作,源自ADAFRUIT_MCP230XX

  • OffchainLabs
  • 进行以太坊的实施

  • huahuayu
  • 统一的API网关,用于将多个Etherscan样区块链Explorer API与对AI助手的模型上下文协议(MCP)支持。

  • deemkeen
  • 用电源组合控制您的MBOT2:MQTT+MCP+LLM

  • jae-jae
  • MCP服务器使用剧作《无头浏览器》获取网页内容。

  • zhaoyunxing92
  • MCP(消息连接器协议)服务

    Reviews

    2 (1)
    Avatar
    user_COR17OVr
    2025-04-16

    Audio-MCP-Server by GongRzhe is an exceptional tool for audio management. As a dedicated MCP app user, I find its functionalities seamless and efficient. The GitHub link provides easy access to the source code and documentation, making installation and usage straightforward. Kudos to the developer for creating such an outstanding product!