Cover image
Try Now
2025-04-02

A multimodal mcp server

3 years

Works with Finder

2

Github Watches

2

Github Forks

1

Github Stars

Multimodal Model Context Protocol Server

Python Version License Status

This repository contains a collection of server implementations for Pixeltable, designed to handle multimodal data indexing and querying (audio, video, images, and documents). These services are orchestrated using Docker for local development.

🚀 Available Servers

Audio Index Server

Located in servers/audio-index/, this server provides:

  • Audio file indexing with transcription capabilities
  • Semantic search over audio content
  • Multi-index support for audio collections
  • Accessible at /audio endpoint

Video Index Server

Located in servers/video-index/, this server provides:

  • Video file indexing with frame extraction
  • Content-based video search
  • Accessible at /video endpoint

Image Index Server

Located in servers/image-index/, this server provides:

  • Image indexing with object detection
  • Similarity search for images
  • Accessible at /image endpoint

Document Index Server

Located in servers/doc-index/, this server provides:

  • Document indexing with text extraction
  • Retrieval-Augmented Generation (RAG) support
  • Accessible at /doc endpoint

Base SDK Server

Located in servers/base-sdk/, this server provides:

  • Core functionality for Pixeltable integration
  • Foundation for building specialized servers

📦 Installation

Local Development

pip install pixeltable
git clone https://github.com/pixeltable/mcp-server-pixeltable.git

cd mcp-server-pixeltable/servers

docker-compose up --build                 # Run locally with docker-compose
docker-compose down                       # Take down resources

🔧 Configuration

  • Each service runs on its designated port (8080 for audio, 8081 for video, 8082 for image, 8083 for doc).
  • Configure service settings in the respective Dockerfile or through environment variables.

🔗 Links

📞 Support

📜 License

This project is licensed under the Apache 2.0 License.

相关推荐

  • NiKole Maxwell
  • I craft unique cereal names, stories, and ridiculously cute Cereal Baby images.

  • Bora Yalcin
  • Evaluator for marketplace product descriptions, checks for relevancy and keyword stuffing.

  • Joshua Armstrong
  • Confidential guide on numerology and astrology, based of GG33 Public information

  • https://suefel.com
  • Latest advice and best practices for custom GPT development.

  • Callycode Limited
  • A geek-themed horoscope generator blending Bitcoin prices, tech jargon, and astrological whimsy.

  • Emmet Halm
  • Converts Figma frames into front-end code for various mobile frameworks.

  • Khalid kalib
  • Write professional emails

  • Elijah Ng Shi Yi
  • Advanced software engineer GPT that excels through nailing the basics.

  • https://maiplestudio.com
  • Find Exhibitors, Speakers and more

  • Lists Tailwind CSS classes in monospaced font

  • apappascs
  • Discover the most comprehensive and up-to-date collection of MCP servers in the market. This repository serves as a centralized hub, offering an extensive catalog of open-source and proprietary MCP servers, complete with features, documentation links, and contributors.

  • ShrimpingIt
  • Micropython I2C-based manipulation of the MCP series GPIO expander, derived from Adafruit_MCP230xx

  • OffchainLabs
  • Go implementation of Ethereum proof of stake

  • huahuayu
  • A unified API gateway for integrating multiple etherscan-like blockchain explorer APIs with Model Context Protocol (MCP) support for AI assistants.

  • deemkeen
  • control your mbot2 with a power combo: mqtt+mcp+llm

    Reviews

    3 (1)
    Avatar
    user_E6XpYwC7
    2025-04-15

    The AI-Powered Server-Client Computer Use SDK by dell-deep is a game-changer! Its seamless integration has streamlined my development process, making it more efficient and effective. The intuitive design and extensive documentation are outstanding. Highly recommend!