Cover image
Try Now
2025-03-19

使用DALL-E生成/编辑图像,MCP(模型上下文协议)服务器

3 years

Works with Finder

1

Github Watches

4

Github Forks

2

Github Stars

DALL-E MCP Server

DALL-E MCP Logo

An MCP (Model Context Protocol) server for generating images using OpenAI's DALL-E API.

Features

  • Generate images using DALL-E 2 or DALL-E 3
  • Edit existing images (DALL-E 2 only)
  • Create variations of existing images (DALL-E 2 only)
  • Validate OpenAI API key

Installation

# Clone the repository
git clone https://github.com/Garoth/dalle-mcp.git
cd dalle-mcp

# Install dependencies
npm install

# Build the project
npm run build

Important Note for Cline Users

When using this DALL-E MCP server with Cline, it's recommended to save generated images in your current workspace directory by setting the saveDir parameter to match your current working directory. This ensures Cline can properly locate and display the generated images in your conversation.

Example usage with Cline:

{
  "prompt": "A tropical beach at sunset",
  "saveDir": "/path/to/current/workspace"
}

Usage

Running the Server

# Run the server
node build/index.js

Configuration for Cline

Add the dall-e server to your Cline MCP settings file inside VSCode's settings (ex. ~/.config/Code/User/globalStorage/saoudrizwan.claude-dev/settings/cline_mcp_settings.json):

{
  "mcpServers": {
    "dalle-mcp": {
      "command": "node",
      "args": ["/path/to/dalle-mcp-server/build/index.js"],
      "env": {
        "OPENAI_API_KEY": "your-api-key-here",
        "SAVE_DIR": "/path/to/save/directory"
      },
      "disabled": false,
      "autoApprove": []
    }
  }
}

Make sure to:

  1. Replace /path/to/dalle-mcp-server/build/index.js with the actual path to the built index.js file
  2. Replace your-api-key-here with your OpenAI API key

Available Tools

generate_image

Generate an image using DALL-E based on a text prompt.

{
  "prompt": "A futuristic city with flying cars and neon lights",
  "model": "dall-e-3",
  "size": "1024x1024",
  "quality": "standard",
  "style": "vivid",
  "n": 1,
  "saveDir": "/path/to/save/directory",
  "fileName": "futuristic-city"
}

Parameters:

  • prompt (required): Text description of the desired image
  • model (optional): DALL-E model to use ("dall-e-2" or "dall-e-3", default: "dall-e-3")
  • size (optional): Size of the generated image (default: "1024x1024")
    • DALL-E 3: "1024x1024", "1792x1024", or "1024x1792"
    • DALL-E 2: "256x256", "512x512", or "1024x1024"
  • quality (optional): Quality of the generated image, DALL-E 3 only ("standard" or "hd", default: "standard")
  • style (optional): Style of the generated image, DALL-E 3 only ("vivid" or "natural", default: "vivid")
  • n (optional): Number of images to generate (1-10, default: 1)
  • saveDir (optional): Directory to save the generated images (default: current directory or SAVE_DIR from .env). For Cline users: Setting this to your current workspace directory is recommended for proper image display.
  • fileName (optional): Base filename for the generated images without extension (default: "dalle-{timestamp}")

edit_image

Edit an existing image using DALL-E based on a text prompt.

⚠️ Known Issue (March 18, 2025): The DALL-E 2 image edit API currently has a bug where it sometimes ignores the prompt and returns the original image without any edits, even when using proper RGBA format images and masks. This issue has been reported in the OpenAI community forum. If you experience this issue, try using the create_variation tool instead, which seems to work more reliably.

{
  "prompt": "Add a red hat",
  "imagePath": "/path/to/image.png",
  "mask": "/path/to/mask.png",
  "model": "dall-e-2",
  "size": "1024x1024",
  "n": 1,
  "saveDir": "/path/to/save/directory",
  "fileName": "edited-image"
}

Parameters:

  • prompt (required): Text description of the desired edits
  • imagePath (required): Path to the image to edit
  • mask (optional): Path to the mask image (white areas will be edited, black areas preserved)
  • model (optional): DALL-E model to use (currently only "dall-e-2" supports editing, default: "dall-e-2")
  • size (optional): Size of the generated image (default: "1024x1024")
  • n (optional): Number of images to generate (1-10, default: 1)
  • saveDir (optional): Directory to save the edited images (default: current directory or SAVE_DIR from .env). For Cline users: Setting this to your current workspace directory is recommended for proper image display.
  • fileName (optional): Base filename for the edited images without extension (default: "dalle-edit-{timestamp}")

create_variation

Create variations of an existing image using DALL-E.

{
  "imagePath": "/path/to/image.png",
  "model": "dall-e-2",
  "size": "1024x1024",
  "n": 4,
  "saveDir": "/path/to/save/directory",
  "fileName": "image-variation"
}

Parameters:

  • imagePath (required): Path to the image to create variations of
  • model (optional): DALL-E model to use (currently only "dall-e-2" supports variations, default: "dall-e-2")
  • size (optional): Size of the generated image (default: "1024x1024")
  • n (optional): Number of variations to generate (1-10, default: 1)
  • saveDir (optional): Directory to save the variation images (default: current directory or SAVE_DIR from .env). For Cline users: Setting this to your current workspace directory is recommended for proper image display.
  • fileName (optional): Base filename for the variation images without extension (default: "dalle-variation-{timestamp}")

validate_key

Validate the OpenAI API key.

{}

No parameters required.

Development

Testing Configuration

Note: The following .env configuration is ONLY needed for running tests, not for normal operation.

If you're developing or running tests for this project, create a .env file in the root directory with your OpenAI API key:

# Required for TESTS ONLY: OpenAI API Key
OPENAI_API_KEY=your-api-key-here

# Optional: Default save directory for test images
# If not specified, images will be saved to the current directory
# SAVE_DIR=/path/to/save/directory

For normal operation with Cline, configure your API key in the MCP settings JSON as described in the "Adding to MCP Settings" section above.

You can get your API key from OpenAI's API Keys page.

Running Tests

# Run basic tests
npm test

# Run all tests including edit and variation tests
npm run test:all

# Run tests in watch mode
npm run test:watch

# Run specific test by name
npm run test:name "should validate API key"

Note: Tests use real API calls and may incur charges on your OpenAI account.

Generating Test Images

The project includes a script to generate test images for development and testing:

# Generate a test image in the assets directory
npm run generate-test-image

This will create a simple test image in the assets directory that can be used for testing the edit and variation features.

License

MIT

相关推荐

  • NiKole Maxwell
  • I craft unique cereal names, stories, and ridiculously cute Cereal Baby images.

  • Joshua Armstrong
  • Confidential guide on numerology and astrology, based of GG33 Public information

  • https://suefel.com
  • Latest advice and best practices for custom GPT development.

  • Emmet Halm
  • Converts Figma frames into front-end code for various mobile frameworks.

  • Khalid kalib
  • Write professional emails

  • Callycode Limited
  • A geek-themed horoscope generator blending Bitcoin prices, tech jargon, and astrological whimsy.

  • Elijah Ng Shi Yi
  • Advanced software engineer GPT that excels through nailing the basics.

  • INFOLAB OPERATIONS 2
  • A medical specialist offering assistance grounded in clinical guidelines. Disclaimer: This is intended for research and is NOT safe for clinical use!

  • Yasir Eryilmaz
  • AI scriptwriting assistant for short, engaging video content.

  • Alexandru Strujac
  • Efficient thumbnail creator for YouTube videos

  • apappascs
  • 发现市场上最全面,最新的MCP服务器集合。该存储库充当集中式枢纽,提供了广泛的开源和专有MCP服务器目录,并提供功能,文档链接和贡献者。

  • ShrimpingIt
  • MCP系列GPIO Expander的基于Micropython I2C的操作,源自ADAFRUIT_MCP230XX

  • OffchainLabs
  • 进行以太坊的实施

  • huahuayu
  • 统一的API网关,用于将多个Etherscan样区块链Explorer API与对AI助手的模型上下文协议(MCP)支持。

  • deemkeen
  • 用电源组合控制您的MBOT2:MQTT+MCP+LLM

  • zhaoyunxing92
  • MCP(消息连接器协议)服务

  • pontusab
  • 光标与风浪冲浪社区,查找规则和MCP

    Reviews

    4 (1)
    Avatar
    user_hQFlyoqi
    2025-04-15

    The OpenSpartan Forerunner is an exceptional tool for Halo Infinite enthusiasts. With its seamless integration and user-friendly interface, it allows for an enhanced gaming experience. Kudos to the author, dend, for crafting such a robust and efficient application. Definitely worth checking out!