
MiniMax Multimodal
OfficialConnects to MiniMax's multimodal AI services to generate images, create videos, convert text to speech, and clone voices. Official JavaScript implementation for accessing MiniMax's AI capabilities.
Official JavaScript implementation that integrates with MiniMax's multimodal capabilities for image generation, video creation, text-to-speech, and voice cloning across multiple transport modes.
What it does
- Generate images from text prompts
- Create videos from descriptions
- Convert text to speech
- Clone voices for audio synthesis
- Access multimodal AI models
- Process multimedia content
Best for
About MiniMax Multimodal
MiniMax Multimodal is an official MCP server published by minimax-ai that provides AI assistants with tools and capabilities via the Model Context Protocol. MiniMax Multimodal JavaScript integrates image, video, text-to-speech, and voice cloning for advanced multimodal experie It is categorized under other, ai ml.
How to install
You can install MiniMax Multimodal in your AI client of choice. Use the install panel on this page to get one-click setup for Cursor, Claude Desktop, VS Code, and other MCP-compatible clients. This server runs locally on your machine via the stdio transport.
License
MiniMax Multimodal is released under the MIT license. This is a permissive open-source license, meaning you can freely use, modify, and distribute the software.

MiniMax MCP JS
JavaScript/TypeScript implementation of MiniMax MCP, providing image generation, video generation, text-to-speech, and more.
Documentation
- δΈζζζ‘£
- Python Version - Official Python implementation of MiniMax MCP
Release Notes
July 22, 2025
π§ Fixes & Improvements
- TTS Tool Fixes: Fixed parameter handling for
languageBoostandsubtitleEnablein thetext_to_audiotool - API Response Enhancement: TTS API can return both audio file and subtitle file, providing a more complete speech-to-text experience
July 7, 2025
π What's New
- Voice Design: New
voice_designtool - create custom voices from descriptive prompts with preview audio - Video Enhancement: Added
MiniMax-Hailuo-02model with ultra-clear quality and duration/resolution controls - Music Generation: Enhanced
music_generationtool powered bymusic-1.5model
π Enhanced Tools
voice_design- Generate personalized voices from text descriptionsgenerate_video- Now supports MiniMax-Hailuo-02 with 6s/10s duration and 768P/1080P resolution optionsmusic_generation- High-quality music creation with music-1.5 model
Features
- Text-to-Speech (TTS)
- Image Generation
- Video Generation
- Voice Cloning
- Music Generation
- Voice Design
- Dynamic configuration (supports both environment variables and request parameters)
- Compatible with MCP platform hosting (ModelScope and other MCP platforms)
Installation
Installing via Smithery
To install MiniMax MCP JS for Claude Desktop automatically via Smithery:
npx -y @smithery/cli install @MiniMax-AI/MiniMax-MCP-JS --client claude
Installing manually
# Install with pnpm (recommended)
pnpm add minimax-mcp-js
Quick Start
MiniMax MCP JS implements the Model Context Protocol (MCP) specification and can be used as a server to interact with MCP-compatible clients (such as Claude AI).
Quickstart with MCP Client
- Get your API key from MiniMax International Platform.
- Make sure that you already installed Node.js and npm
- Important: API HOST&KEY are different in different region, they must match, otherwise you will receive an
Invalid API keyerror.
| Region | Global | Mainland |
|---|---|---|
| MINIMAX_API_KEY | go get from MiniMax Global | go get from MiniMax |
| MINIMAX_API_HOST | βhttps://api.minimaxi.chat (note the extra "i") | βhttps://api.minimax.chat |
Using with MCP Clients (Recommended)
Configure your MCP client:
Claude Desktop
Go to Claude > Settings > Developer > Edit Config > claude_desktop_config.json to include:
{
"mcpServers": {
"minimax-mcp-js": {
"command": "npx",
"args": [
"-y",
"minimax-mcp-js"
],
"env": {
"MINIMAX_API_HOST": "<https://api.minimaxi.chat|https://api.minimax.chat>",
"MINIMAX_API_KEY": "<your-api-key-here>",
"MINIMAX_MCP_BASE_PATH": "<local-output-dir-path, such as /User/xxx/Desktop>",
"MINIMAX_RESOURCE_MODE": "<optional, [url|local], url is default, audio/image/video are downloaded locally or provided in URL format>"
}
}
}
}
Cursor
Go to Cursor β Preferences β Cursor Settings β MCP β Add new global MCP Server to add the above config.
β οΈ Note: If you encounter a "No tools found" error when using MiniMax MCP JS with Cursor, please update your Cursor to the latest version. For more information, see this discussion thread.
That's it. Your MCP client can now interact with MiniMax through these tools.
For local development:
When developing locally, you can use npm link to test your changes:
# In your project directory
npm link
Then configure Claude Desktop or Cursor to use npx as shown above. This will automatically use your linked version.
β οΈ Note: The API key needs to match the host address. Different hosts are used for global and mainland China versions:
- Global Host:
https://api.minimaxi.chat(note the extra "i") - Mainland China Host:
https://api.minimaxi.chat
Transport Modes
MiniMax MCP JS supports three transport modes:
| Feature | stdio (default) | REST | SSE |
|---|---|---|---|
| Environment | Local only | Local or cloud deployment | Local or cloud deployment |
| Communication | Via standard I/O | Via HTTP requests | Via server-sent events |
| Use Cases | L |
README truncated. View full README on GitHub.
Alternatives
Related Skills
Browse all skillsCreate professional draw.io (diagrams.net) diagrams in XML format (.drawio files) with integrated PMP/PMBOK methodologies, extensive visual asset libraries, and industry-standard professional templates. Use this skill when users ask to create flowcharts, swimlane diagrams, cross-functional flowcharts, org charts, network diagrams, UML diagrams, BPMN, project management diagrams (WBS, Gantt, PERT, RACI), risk matrices, stakeholder maps, or any other visual diagram in draw.io format. This skill includes access to custom shape libraries for icons, clipart, and professional symbols.
Presentation creation, editing, and analysis. When Claude needs to work with presentations (.pptx files) for: (1) Creating new presentations, (2) Modifying or editing content, (3) Working with layouts, (4) Adding comments or speaker notes, or any other presentation tasks
Comprehensive document creation, editing, and analysis with support for tracked changes, comments, formatting preservation, and text extraction. When Claude needs to work with professional documents (.docx files) for: (1) Creating new documents, (2) Modifying or editing content, (3) Working with tracked changes, (4) Adding comments, or any other document tasks
Downloads videos from YouTube and other platforms for offline viewing, editing, or archival. Handles various formats and quality options.
Provides comprehensive technical analysis for stocks and ETFs using RSI, MACD, Bollinger Bands, and other indicators. Activates when user requests stock analysis, technical indicators, trading signals, or market data for specific ticker symbols.
Install Codex skills into $CODEX_HOME/skills from a curated list or a GitHub repo path. Use when a user asks to list installable skills, install a curated skill, or install a skill from another repo (including private repos).