Cover image
Try Now
2025-04-13

Implémentation du serveur de protocole de contexte du modèle (MCP) avec l'intégration de l'API MinimAx

3 years

Works with Finder

1

Github Watches

8

Github Forks

41

Github Stars

Minimax MCP Tools

Minimax MCP Tools Banner

A Model Context Protocol (MCP) server implementation with Minimax API integration for AI-powered image generation and text-to-speech functionality.

English | 简体中文

Features

  • Image Generation: Generate high-quality images based on text prompts using Minimax's image-01 model
  • Text-to-Speech (TTS): Convert text to natural-sounding speech with various voice options, emotions, and audio formats

Setup

Prerequisites

  • Node.js 16 or higher
  • A Minimax API key (obtain from Minimax Platform)
  • Minimax Group ID for TTS functionality

Configuration

Create or update your MCP configuration file:

{
  "mcpServers": {
    "minimax-mcp-tools": {
      "command": "npx",
      "args": [
        "minimax-mcp-tools"
      ],
      "env": {
        "MINIMAX_API_KEY": "your-minimax-api-key",
        "MINIMAX_GROUP_ID": "your-minimax-group-id"
      }
    }
  }
}

MCP Interface

Image Generation

Generate images based on text prompts:

// Example parameters for image generation
{
  "prompt": "A mountain landscape at sunset",
  "aspectRatio": "16:9",
  "n": 1,
  "outputFile": "/absolute/path/to/image.jpg",
  "subjectReference": "/path/to/reference.jpg" // Optional: local file or URL
}

Parameters:

  • prompt (required): Description of the image to generate
  • outputFile (required): Absolute path to save the generated image file. The directory must already exist. When generating multiple images (n>1), files will be named with sequential numbers (e.g., 'image-1.jpg', 'image-2.jpg').
  • aspectRatio (optional): Aspect ratio of the image (default: "1:1", options: "1:1", "16:9", "4:3", "3:2", "2:3", "3:4", "9:16", "21:9")
  • n (optional): Number of images to generate (default: 1, range: 1-9). When n>1, the output filenames will be automatically numbered.
  • subjectReference (optional): Path to a local image file or a public URL for character reference. When provided, the generated image will use this as a reference for character appearance. Supported formats: JPG, JPEG, PNG

Text-to-Speech

Convert text to speech with various customization options:

// Example parameters for text-to-speech
{
  "text": "Hello, this is a test of the text-to-speech functionality.",
  "model": "speech-02-hd",
  "voiceId": "female-shaonv",
  "speed": 1.0,
  "volume": 1.0,
  "pitch": 0,
  "emotion": "happy",
  "format": "mp3",
  "outputFile": "/absolute/path/to/audio.mp3",
  "subtitleEnable": true
}

Basic Parameters:

  • text (required): Text to convert to speech (max 10,000 characters)
  • outputFile (required): Absolute path to save the generated audio file
  • model (optional): Model version to use (default: "speech-02-hd", options: "speech-02-hd", "speech-02-turbo")
    • speech-02-hd: High-definition model with excellent timbre similarity, rhythm stability, and studio-grade audio quality
    • speech-02-turbo: Fast model with excellent performance and low latency, enhanced multilingual capabilities
  • voiceId (optional): Voice ID to use (default: "male-qn-qingse")
  • speed (optional): Speech speed (default: 1.0, range: 0.5-2.0)
  • volume (optional): Speech volume (default: 1.0, range: 0.1-10.0)
  • pitch (optional): Speech pitch (default: 0, range: -12 to 12)
  • emotion (optional): Emotion of the speech (default: "neutral", options: "happy", "sad", "angry", "fearful", "disgusted", "surprised", "neutral")
  • timberWeights (optional): Voice mixing settings, allows mixing up to 4 different voices with weights
    "timberWeights": [
      { "voice_id": "male-qn-qingse", "weight": 70 },
      { "voice_id": "female-shaonv", "weight": 30 }
    ]
    

Audio Settings:

  • format (optional): Audio format (default: "mp3", options: "mp3", "pcm", "flac", "wav")
  • sampleRate (optional): Sample rate in Hz (default: 32000, options: 8000, 16000, 22050, 24000, 32000, 44100)
  • bitrate (optional): Bitrate for MP3 format (default: 128000, options: 32000, 64000, 128000, 256000)
  • channel (optional): Number of audio channels (default: 1, options: 1=mono, 2=stereo)

Advanced Features:

  • latexRead (optional): Whether to read LaTeX formulas (default: false)
  • pronunciationDict (optional): List of pronunciation replacements
    "pronunciationDict": ["处理/(chu3)(li3)", "危险/dangerous"]
    
  • stream (optional): Whether to use streaming mode (default: false)
  • languageBoost (optional): Enhance recognition of specific languages
    • Options: "Chinese", "Chinese,Yue", "English", "Arabic", "Russian", "Spanish", "French", "Portuguese", "German", "Turkish", "Dutch", "Ukrainian", "Vietnamese", "Indonesian", "Japanese", "Italian", "Korean", "Thai", "Polish", "Romanian", "Greek", "Czech", "Finnish", "Hindi", "auto"
  • subtitleEnable (optional): Whether to enable subtitle generation (default: false)

License

MIT

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Acknowledgements

相关推荐

  • Joshua Armstrong
  • Confidential guide on numerology and astrology, based of GG33 Public information

  • https://suefel.com
  • Latest advice and best practices for custom GPT development.

  • Emmet Halm
  • Converts Figma frames into front-end code for various mobile frameworks.

  • Elijah Ng Shi Yi
  • Advanced software engineer GPT that excels through nailing the basics.

  • https://maiplestudio.com
  • Find Exhibitors, Speakers and more

  • Yusuf Emre Yeşilyurt
  • I find academic articles and books for research and literature reviews.

  • Carlos Ferrin
  • Encuentra películas y series en plataformas de streaming.

  • https://zenepic.net
  • Embark on a thrilling diplomatic quest across a galaxy on the brink of war. Navigate complex politics and alien cultures to forge peace and avert catastrophe in this immersive interstellar adventure.

  • https://reddgr.com
  • Delivers concise Python code and interprets non-English comments

  • apappascs
  • Découvrez la collection la plus complète et la plus à jour de serveurs MCP sur le marché. Ce référentiel sert de centre centralisé, offrant un vaste catalogue de serveurs MCP open-source et propriétaires, avec des fonctionnalités, des liens de documentation et des contributeurs.

  • ShrimpingIt
  • Manipulation basée sur Micropython I2C de l'exposition GPIO de la série MCP, dérivée d'Adafruit_MCP230XX

  • pontusab
  • La communauté du curseur et de la planche à voile, recherchez des règles et des MCP

  • av
  • Exécutez sans effort LLM Backends, API, Frontends et Services avec une seule commande.

  • 1Panel-dev
  • 🔥 1Panel fournit une interface Web intuitive et un serveur MCP pour gérer des sites Web, des fichiers, des conteneurs, des bases de données et des LLM sur un serveur Linux.

  • Mintplex-Labs
  • L'application tout-en-un desktop et Docker AI avec chiffon intégré, agents AI, constructeur d'agent sans code, compatibilité MCP, etc.

  • GeyserMC
  • Une bibliothèque de communication avec un client / serveur Minecraft.

  • awslabs
  • Serveurs AWS MCP - Serveurs MCP spécialisés qui apportent les meilleures pratiques AWS directement à votre flux de travail de développement

  • appcypher
  • Serveurs MCP géniaux - une liste organisée de serveurs de protocole de contexte de modèle

  • WangRongsheng
  • 🧑‍🚀 全世界最好的 LLM 资料总结 (数据处理、模型训练、模型部署、 O1 模型、 MCP 、小语言模型、视觉语言模型) | Résumé des meilleures ressources LLM du monde.

    Reviews

    2 (1)
    Avatar
    user_HX820RQe
    2025-04-17

    As a dedicated user of minimax-mcp-tools, I can confidently say this tool has significantly streamlined my projects. PsychArch did an incredible job with its intuitive design and functionality. The tools are robust and user-friendly, making complex tasks much easier. I highly recommend it for anyone looking to maximize productivity and efficiency in their workflow. Check it out at the official GitHub page: https://github.com/PsychArch/minimax-mcp-tools.