Markdownify MCP

Name: Markdownify MCP
Rating: 4.5 (1363 reviews)
Author: zcaceres

Converts various file formats (PDFs, images, DOCX, XLSX, web pages, YouTube videos) into clean Markdown that AI assistants can read and analyze.

Convert almost anything to Markdown. Transforms PDFs, images, web pages, DOCX, XLSX, and other formats into clean Markdown that AI assistants can read and analyze. 2,400+ GitHub stars.

2,43892 views200Local (stdio)

productivity developer tools

GitHub

What it does

Convert PDFs to Markdown
Transform images to readable text
Extract YouTube video transcripts
Convert Office documents (DOCX, XLSX, PPTX)
Turn web pages into Markdown
Transcribe audio files to text

Best for

Content creators processing diverse file formatsResearchers analyzing documents with AI assistantsDevelopers building document processing workflows

2,400+ GitHub starsSupports 6+ file formatsYouTube transcript extraction

About Markdownify MCP

Markdownify MCP is a community-built MCP server published by zcaceres that provides AI assistants with tools and capabilities via the Model Context Protocol. Convert almost anything to Markdown. Transforms PDFs, images, web pages, DOCX, XLSX, and other formats into clean Markdo It is categorized under productivity, developer tools.

How to install

You can install Markdownify MCP in your AI client of choice. Use the install panel on this page to get one-click setup for Cursor, Claude Desktop, VS Code, and other MCP-compatible clients. This server runs locally on your machine via the stdio transport.

License

Markdownify MCP is released under the MIT license. This is a permissive open-source license, meaning you can freely use, modify, and distribute the software.

Markdownify MCP Server

Help! I need someone with a Windows computer to help me add support for Markdownify-MCP on Windows. PRs exist but I cannot test them. Post here if interested.

Markdownify is a Model Context Protocol (MCP) server that converts various file types and web content to Markdown format. It provides a set of tools to transform PDFs, images, audio files, web pages, and more into easily readable and shareable Markdown text.

Features

Convert multiple file types to Markdown:
- PDF
- Images
- Audio (with transcription)
- DOCX
- XLSX
- PPTX
Convert web content to Markdown:
- YouTube video transcripts
- Bing search results
- General web pages
Retrieve existing Markdown files

Getting Started

Clone this repository
Install dependencies:
```
pnpm install
```

Note: this will also install uv and related Python depdencies.

Build the project:
```
pnpm run build
```
Start the server:
```
pnpm start
```

Development

Use pnpm run dev to start the TypeScript compiler in watch mode
Modify src/server.ts to customize server behavior
Add or modify tools in src/tools.ts

Usage with Desktop App

To integrate this server with a desktop app, add the following to your app's server configuration:

{
  "mcpServers": {
    "markdownify": {
      "command": "node",
      "args": [
        "{ABSOLUTE PATH TO FILE HERE}/dist/index.js"
      ],
      "env": {
        // By default, the server will use the default install location of `uv`
        "UV_PATH": "/path/to/uv"
      }
    }
  }
}

Available Tools

youtube-to-markdown: Convert YouTube videos to Markdown
pdf-to-markdown: Convert PDF files to Markdown
bing-search-to-markdown: Convert Bing search results to Markdown
webpage-to-markdown: Convert web pages to Markdown
image-to-markdown: Convert images to Markdown with metadata
audio-to-markdown: Convert audio files to Markdown with transcription
docx-to-markdown: Convert DOCX files to Markdown
xlsx-to-markdown: Convert XLSX files to Markdown
pptx-to-markdown: Convert PPTX files to Markdown
get-markdown-file: Retrieve an existing Markdown file. File extension must end with: *.md, *.markdown.

OPTIONAL: set MD_SHARE_DIR env var to restrict the directory from which files can be retrieved, e.g. MD_SHARE_DIR=[SOME_PATH] pnpm run start

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Alternatives

Chrome DevTools MCP

chromedevtools

28.1k

AI-driven control of live Chrome via Chrome DevTools: browser automation, debugging, performance analysis and network mo

OfficialPopular

50711

Chrome DevTools

chromedevtools

28.1k

Use Chrome DevTools for web site test speed, debugging, and performance analysis. The essential chrome developer tools f

OfficialPopular

3.9k172

GitHub

github

27.6k

Extend your developer tools with GitHub MCP Server for advanced automation, supporting GitHub Student and student packag

OfficialRemotePopular

4.5k232

Task Master

eyaltoledano

25.8k

Boost productivity with Task Master: an AI-powered tool for project management and agile development workflows, integrat

CommunityPopular

4.9k114

Related Skills

Browse all skills

ui-design-system

UI design system toolkit for Senior UI Designer including design token generation, component documentation, responsive design calculations, and developer handoff tools. Use for creating design systems, maintaining visual consistency, and facilitating design-dev collaboration.

ai-sdk

Answer questions about the AI SDK and help build AI-powered features. Use when developers: (1) Ask about AI SDK functions like generateText, streamText, ToolLoopAgent, embed, or tools, (2) Want to build AI agents, chatbots, RAG systems, or text generation features, (3) Have questions about AI providers (OpenAI, Anthropic, Google, etc.), streaming, tool calling, structured output, or embeddings, (4) Use React hooks like useChat or useCompletion. Triggers on: "AI SDK", "Vercel AI SDK", "generateText", "streamText", "add AI to my app", "build an agent", "tool calling", "structured output", "useChat".

ai-assisted-development

Leveraging AI coding assistants and tools to boost development productivity, while maintaining oversight to ensure quality results.

api-documenter

Master API documentation with OpenAPI 3.1, AI-powered tools, and modern developer experience practices. Create interactive docs, generate SDKs, and build comprehensive developer portals. Use PROACTIVELY for API documentation or developer portal creation.

teams-channel-post-writer

Creates educational Teams channel posts for internal knowledge sharing about Claude Code features, tools, and best practices. Applies when writing posts, announcements, or documentation to teach colleagues effective Claude Code usage, announce new features, share productivity tips, or document lessons learned. Provides templates, writing guidelines, and structured approaches emphasizing concrete examples, underlying principles, and connections to best practices like context engineering. Activates for content involving Teams posts, channel announcements, feature documentation, or tip sharing.

openai-knowledge

Use when working with the OpenAI API (Responses API) or OpenAI platform features (tools, streaming, Realtime API, auth, models, rate limits, MCP) and you need authoritative, up-to-date documentation (schemas, examples, limits, edge cases). Prefer the OpenAI Developer Documentation MCP server tools when available; otherwise guide the user to enable `openaiDeveloperDocs`.