MiniMax Multimodal

MiniMax Multimodal

Official
minimax-ai

Connects to MiniMax's multimodal AI services to generate images, create videos, convert text to speech, and clone voices. Official JavaScript implementation for accessing MiniMax's AI capabilities.

Official JavaScript implementation that integrates with MiniMax's multimodal capabilities for image generation, video creation, text-to-speech, and voice cloning across multiple transport modes.

106415 views36Local (stdio)

What it does

  • Generate images from text prompts
  • Create videos from descriptions
  • Convert text to speech
  • Clone voices for audio synthesis
  • Access multimodal AI models
  • Process multimedia content

Best for

Content creators needing AI-generated mediaDevelopers building multimedia applicationsTeams automating video and audio productionApplications requiring voice synthesis
Official MiniMax implementationMultiple AI modalities in one serverJavaScript/TypeScript support

About MiniMax Multimodal

MiniMax Multimodal is an official MCP server published by minimax-ai that provides AI assistants with tools and capabilities via the Model Context Protocol. MiniMax Multimodal JavaScript integrates image, video, text-to-speech, and voice cloning for advanced multimodal experie It is categorized under other, ai ml.

How to install

You can install MiniMax Multimodal in your AI client of choice. Use the install panel on this page to get one-click setup for Cursor, Claude Desktop, VS Code, and other MCP-compatible clients. This server runs locally on your machine via the stdio transport.

License

MiniMax Multimodal is released under the MIT license. This is a permissive open-source license, meaning you can freely use, modify, and distribute the software.

export

MiniMax MCP JS

JavaScript/TypeScript implementation of MiniMax MCP, providing image generation, video generation, text-to-speech, and more.

Documentation

Release Notes

July 22, 2025

πŸ”§ Fixes & Improvements

  • TTS Tool Fixes: Fixed parameter handling for languageBoost and subtitleEnable in the text_to_audio tool
  • API Response Enhancement: TTS API can return both audio file and subtitle file, providing a more complete speech-to-text experience

July 7, 2025

πŸ†• What's New

  • Voice Design: New voice_design tool - create custom voices from descriptive prompts with preview audio
  • Video Enhancement: Added MiniMax-Hailuo-02 model with ultra-clear quality and duration/resolution controls
  • Music Generation: Enhanced music_generation tool powered by music-1.5 model

πŸ“ˆ Enhanced Tools

  • voice_design - Generate personalized voices from text descriptions
  • generate_video - Now supports MiniMax-Hailuo-02 with 6s/10s duration and 768P/1080P resolution options
  • music_generation - High-quality music creation with music-1.5 model

Features

  • Text-to-Speech (TTS)
  • Image Generation
  • Video Generation
  • Voice Cloning
  • Music Generation
  • Voice Design
  • Dynamic configuration (supports both environment variables and request parameters)
  • Compatible with MCP platform hosting (ModelScope and other MCP platforms)

Installation

Installing via Smithery

To install MiniMax MCP JS for Claude Desktop automatically via Smithery:

npx -y @smithery/cli install @MiniMax-AI/MiniMax-MCP-JS --client claude

Installing manually

# Install with pnpm (recommended)
pnpm add minimax-mcp-js

Quick Start

MiniMax MCP JS implements the Model Context Protocol (MCP) specification and can be used as a server to interact with MCP-compatible clients (such as Claude AI).

Quickstart with MCP Client

  1. Get your API key from MiniMax International Platform.
  2. Make sure that you already installed Node.js and npm
  3. Important: API HOST&KEY are different in different region, they must match, otherwise you will receive an Invalid API key error.
RegionGlobalMainland
MINIMAX_API_KEYgo get from MiniMax Globalgo get from MiniMax
MINIMAX_API_HOST​https://api.minimaxi.chat (note the extra "i")​https://api.minimax.chat

Using with MCP Clients (Recommended)

Configure your MCP client:

Claude Desktop

Go to Claude > Settings > Developer > Edit Config > claude_desktop_config.json to include:

{
  "mcpServers": {
    "minimax-mcp-js": {
      "command": "npx",
      "args": [
        "-y",
        "minimax-mcp-js"
      ],
      "env": {
        "MINIMAX_API_HOST": "<https://api.minimaxi.chat|https://api.minimax.chat>",
        "MINIMAX_API_KEY": "<your-api-key-here>",
        "MINIMAX_MCP_BASE_PATH": "<local-output-dir-path, such as /User/xxx/Desktop>",
        "MINIMAX_RESOURCE_MODE": "<optional, [url|local], url is default, audio/image/video are downloaded locally or provided in URL format>"
      }
    }
  }
}

Cursor

Go to Cursor β†’ Preferences β†’ Cursor Settings β†’ MCP β†’ Add new global MCP Server to add the above config.

⚠️ Note: If you encounter a "No tools found" error when using MiniMax MCP JS with Cursor, please update your Cursor to the latest version. For more information, see this discussion thread.

That's it. Your MCP client can now interact with MiniMax through these tools.

For local development: When developing locally, you can use npm link to test your changes:

# In your project directory
npm link

Then configure Claude Desktop or Cursor to use npx as shown above. This will automatically use your linked version.

⚠️ Note: The API key needs to match the host address. Different hosts are used for global and mainland China versions:

  • Global Host: https://api.minimaxi.chat (note the extra "i")
  • Mainland China Host: https://api.minimaxi.chat

Transport Modes

MiniMax MCP JS supports three transport modes:

Featurestdio (default)RESTSSE
EnvironmentLocal onlyLocal or cloud deploymentLocal or cloud deployment
CommunicationVia standard I/OVia HTTP requestsVia server-sent events
Use CasesL

README truncated. View full README on GitHub.

Alternatives

Related Skills

Browse all skills
drawio-diagrams-enhanced

Create professional draw.io (diagrams.net) diagrams in XML format (.drawio files) with integrated PMP/PMBOK methodologies, extensive visual asset libraries, and industry-standard professional templates. Use this skill when users ask to create flowcharts, swimlane diagrams, cross-functional flowcharts, org charts, network diagrams, UML diagrams, BPMN, project management diagrams (WBS, Gantt, PERT, RACI), risk matrices, stakeholder maps, or any other visual diagram in draw.io format. This skill includes access to custom shape libraries for icons, clipart, and professional symbols.

1,093
pptx

Presentation creation, editing, and analysis. When Claude needs to work with presentations (.pptx files) for: (1) Creating new presentations, (2) Modifying or editing content, (3) Working with layouts, (4) Adding comments or speaker notes, or any other presentation tasks

210
docx

Comprehensive document creation, editing, and analysis with support for tracked changes, comments, formatting preservation, and text extraction. When Claude needs to work with professional documents (.docx files) for: (1) Creating new documents, (2) Modifying or editing content, (3) Working with tracked changes, (4) Adding comments, or any other document tasks

59
video-downloader

Downloads videos from YouTube and other platforms for offline viewing, editing, or archival. Handles various formats and quality options.

52
stock-analyzer

Provides comprehensive technical analysis for stocks and ETFs using RSI, MACD, Bollinger Bands, and other indicators. Activates when user requests stock analysis, technical indicators, trading signals, or market data for specific ticker symbols.

42
skill-installer

Install Codex skills into $CODEX_HOME/skills from a curated list or a GitHub repo path. Use when a user asks to list installable skills, install a curated skill, or install a skill from another repo (including private repos).

26