Cover image
art à croisement court
Private

art à croisement court

Try Now
2025-04-14

Crée de courtes vidéos pour Tiktok, Instagram Reels et YouTube Shorts à l'aide du Protocole de contexte modèle (MCP) et d'une API REST.

3 years

Works with Finder

34

Github Watches

14

Github Forks

34

Github Stars

Short Video Maker

An open source automated video creation tool for generating short-form video content. Short Video Maker combines text-to-speech, automatic captions, background videos, and music to create engaging short videos from simple text inputs.

This repository was open-sourced by the AI Agents A-Z Youtube Channel. We encourage you to check out the channel for more AI-related content and tutorials.

Hardware requirements

  • CPU: at least 2 cores are recommended
  • GPU: optional, makes the caption generation a lot faster (whisper.cpp) and the video rendering somewhat faster

Watch the official video on how to generate videos with n8n

Automated faceless video generation (n8n + MCP) with captions, background music, local and 100% free

Running the Project

Using NPX (recommended)

The easiest way to run the project with GPU support out of the box:

LOG_LEVEL=debug PEXELS_API_KEY= npx short-video-maker

Using Docker

CPU image

docker run -it --rm --name short-video-maker -p 3123:3123 \
  -e PEXELS_API_KEY= \
  gyoridavid/short-video-maker:latest

NVIDIA GPUs

docker run -it --rm --name shorts-video-maker -p 3123:3123 \
  -e PEXELS_API_KEY= --gpus=all \
  gyoridavid/short-video-maker:latest-cuda

Find help

Join our Discord community for support and discussions.

Environment Variables

Variable Description
PEXELS_API_KEY Your Pexels API key for background video sourcing
PORT Port for the API/MCP server (default: 3123)
LOG_LEVEL Log level for the server (default: info, options: trace, debug, info, warn, error)
WHISPER_VERBOSE Verbose mode for Whisper (default: false)

Example

{
  "scenes": [
    {
      "text": "Hello world! Enjoy using this tool to create awesome AI workflows",
      "searchTerms": ["rainbow"]
    }
  ],
  "config": {
    "paddingBack": 1500,
    "music": "happy"
  }
}

Features

  • Generate complete short videos from text prompts
  • Text-to-speech conversion
  • Automatic caption generation and styling
  • Background video search and selection via Pexels
  • Background music with genre/mood selection
  • Serve as both REST API and Model Context Protocol (MCP) server

How It Works

Shorts Creator takes simple text inputs and search terms, then:

  1. Converts text to speech using Kokoro TTS
  2. Generates accurate captions via Whisper
  3. Finds relevant background videos from Pexels
  4. Composes all elements with Remotion
  5. Renders a professional-looking short video with perfectly timed captions

Dependencies for the video generation

Dependency Version License Purpose
Remotion ^4.0.286 Remotion License Video composition and rendering
Whisper CPP v1.5.5 MIT Speech-to-text for captions
FFmpeg ^2.1.3 LGPL/GPL Audio/video manipulation
Kokoro.js ^1.2.0 MIT Text-to-speech generation
Pexels API N/A Pexels Terms Background videos

How to contribute?

PRs are welcome. See the CONTRIBUTING.md file for instructions on setting up a local development environment.

API Usage

REST API

The following REST endpoints are available:

  1. GET /api/short-video/:id - Get a video by ID and also can be downloaded like this :

curl -o output.mp4 http://localhost:3123/api/short-video/<videoId>

  1. POST /api/short-video - Create a new video
    {
      "scenes": [
        {
          "text": "This is the text to be spoken in the video",
          "searchTerms": ["nature sunset"]
        }
      ],
      "config": {
        "paddingBack": 3000,
        "music": "chill"
      }
    }
    
  2. DELETE /api/short-video/:id - Delete a video by ID
  3. GET /api/music-tags - Get available music tags

Model Context Protocol (MCP)

The service also implements the Model Context Protocol:

  1. GET /mcp/sse - Server-sent events for MCP
  2. POST /mcp/messages - Send messages to MCP server

Available MCP tools:

  • create-short-video - Create a video from a list of scenes
  • get-video-status - Check video creation status

License

This project is licensed under the MIT License.

Acknowledgments

  • ❤️ Remotion for programmatic video generation
  • ❤️ Whisper for speech-to-text
  • ❤️ Pexels for video content
  • ❤️ FFmpeg for audio/video processing
  • ❤️ Kokoro for TTS

相关推荐

  • av
  • Exécutez sans effort LLM Backends, API, Frontends et Services avec une seule commande.

  • 1Panel-dev
  • 🔥 1Panel fournit une interface Web intuitive et un serveur MCP pour gérer des sites Web, des fichiers, des conteneurs, des bases de données et des LLM sur un serveur Linux.

  • rulego
  • ⛓️RULEGO est un cadre de moteur de règle d'orchestration des composants de nouvelle génération légère, intégrée, intégrée et de nouvelle génération pour GO.

  • WangRongsheng
  • 🧑‍🚀 全世界最好的 LLM 资料总结 (数据处理、模型训练、模型部署、 O1 模型、 MCP 、小语言模型、视觉语言模型) | Résumé des meilleures ressources LLM du monde.

  • Byaidu
  • PDF Traduction de papier scientifique avec formats conservés - 基于 AI 完整保留排版的 PDF 文档全文双语翻译 , 支持 Google / Deepl / Olllama / Openai 等服务 , 提供 CLI / GUI / MCP / DOCKER / ZOTERO

  • hkr04
  • SDK C ++ MCP (Protocole de contexte modèle léger)

  • sigoden
  • Créez facilement des outils et des agents LLM à l'aide de fonctions Plain Bash / JavaScript / Python.

  • lasso-security
  • Une passerelle basée sur un plugin qui orchestre d'autres MCP et permet aux développeurs de s'appuyer sur des agents de qualité d'entreprise informatiques.

  • RockChinQ
  • 😎简单易用、🧩丰富生态 - 大模型原生即时通信机器人平台 | 适配 QQ / 微信 (企业微信、个人微信) / 飞书 / 钉钉 / Discord / Telegram / Slack 等平台 | 支持 Chatgpt 、 Deepseek 、 Dify 、 Claude 、 GEMINI 、 XAI 、 PPIO 、 OLLAMA 、 LM Studio 、阿里云百炼、火山方舟、 Siliconflow 、 Qwen 、 Moonshot 、 ChatGlm 、 Sillytraven 、 MCP 等 LLM 的机器人 / Agent | Plateforme de bots de messagerie instantanée basés sur LLM, prend en charge Discord, Telegram, WeChat, Lark, Dingtalk, QQ, Slack

  • modelscope
  • Commencez à construire des applications multi-agents LLM, plus facilement.

    Reviews

    2.8 (4)
    Avatar
    user_NoQGmbxq
    2025-04-24

    Short-video-maker by gyoridavid is a fantastic tool for creating engaging short videos quickly and efficiently. Its user-friendly interface and seamless workflow make it perfect for both beginners and professionals. Highly recommend for anyone looking to enhance their video content effortlessly!

    Avatar
    user_bgCv36NH
    2025-04-24

    I've been using Short-Video-Maker by gyoridavid and it's simply amazing! This tool makes creating short videos a breeze with its intuitive interface and powerful features. The end results are always professional and engaging, making it perfect for anyone looking to enhance their video content effortlessly. Highly recommended!

    Avatar
    user_Jrqi7J69
    2025-04-24

    As a loyal user of the MCP app, I recently tried the short-video-maker by gyoridavid, and I'm thoroughly impressed. It's incredibly user-friendly and makes creating short videos so much fun. The seamless interface and intuitive features let me express my creativity effortlessly. Highly recommend this tool for anyone looking to make engaging short videos!

    Avatar
    user_N8LxunWe
    2025-04-24

    As a dedicated MCP user, I find the Short-Video-Maker by gyoridavid to be an exceptional tool for crafting engaging short videos. It offers a seamless experience with intuitive controls, making it perfect for both beginners and experienced creators. The streamlined interface and robust features have significantly enhanced my video production process. Highly recommend this amazing app!