
Markdownify MCP
Converts various file formats (PDFs, images, DOCX, XLSX, web pages, YouTube videos) into clean Markdown that AI assistants can read and analyze.
Convert almost anything to Markdown. Transforms PDFs, images, web pages, DOCX, XLSX, and other formats into clean Markdown that AI assistants can read and analyze. 2,400+ GitHub stars.
What it does
- Convert PDFs to Markdown
- Transform images to readable text
- Extract YouTube video transcripts
- Convert Office documents (DOCX, XLSX, PPTX)
- Turn web pages into Markdown
- Transcribe audio files to text
Best for
About Markdownify MCP
Markdownify MCP is a community-built MCP server published by zcaceres that provides AI assistants with tools and capabilities via the Model Context Protocol. Convert almost anything to Markdown. Transforms PDFs, images, web pages, DOCX, XLSX, and other formats into clean Markdo It is categorized under productivity, developer tools.
How to install
You can install Markdownify MCP in your AI client of choice. Use the install panel on this page to get one-click setup for Cursor, Claude Desktop, VS Code, and other MCP-compatible clients. This server runs locally on your machine via the stdio transport.
License
Markdownify MCP is released under the MIT license. This is a permissive open-source license, meaning you can freely use, modify, and distribute the software.
Markdownify MCP Server
Help! I need someone with a Windows computer to help me add support for Markdownify-MCP on Windows. PRs exist but I cannot test them. Post here if interested.

Markdownify is a Model Context Protocol (MCP) server that converts various file types and web content to Markdown format. It provides a set of tools to transform PDFs, images, audio files, web pages, and more into easily readable and shareable Markdown text.
Features
- Convert multiple file types to Markdown:
- Images
- Audio (with transcription)
- DOCX
- XLSX
- PPTX
- Convert web content to Markdown:
- YouTube video transcripts
- Bing search results
- General web pages
- Retrieve existing Markdown files
Getting Started
- Clone this repository
- Install dependencies:
pnpm install
Note: this will also install uv and related Python depdencies.
- Build the project:
pnpm run build - Start the server:
pnpm start
Development
- Use
pnpm run devto start the TypeScript compiler in watch mode - Modify
src/server.tsto customize server behavior - Add or modify tools in
src/tools.ts
Usage with Desktop App
To integrate this server with a desktop app, add the following to your app's server configuration:
{
"mcpServers": {
"markdownify": {
"command": "node",
"args": [
"{ABSOLUTE PATH TO FILE HERE}/dist/index.js"
],
"env": {
// By default, the server will use the default install location of `uv`
"UV_PATH": "/path/to/uv"
}
}
}
}
Available Tools
-
youtube-to-markdown: Convert YouTube videos to Markdown -
pdf-to-markdown: Convert PDF files to Markdown -
bing-search-to-markdown: Convert Bing search results to Markdown -
webpage-to-markdown: Convert web pages to Markdown -
image-to-markdown: Convert images to Markdown with metadata -
audio-to-markdown: Convert audio files to Markdown with transcription -
docx-to-markdown: Convert DOCX files to Markdown -
xlsx-to-markdown: Convert XLSX files to Markdown -
pptx-to-markdown: Convert PPTX files to Markdown -
get-markdown-file: Retrieve an existing Markdown file. File extension must end with: *.md, *.markdown.OPTIONAL: set
MD_SHARE_DIRenv var to restrict the directory from which files can be retrieved, e.g.MD_SHARE_DIR=[SOME_PATH] pnpm run start
Contributing
Contributions are welcome! Please feel free to submit a Pull Request.
License
This project is licensed under the MIT License - see the LICENSE file for details.
Alternatives
Related Skills
Browse all skillsUI design system toolkit for Senior UI Designer including design token generation, component documentation, responsive design calculations, and developer handoff tools. Use for creating design systems, maintaining visual consistency, and facilitating design-dev collaboration.
Answer questions about the AI SDK and help build AI-powered features. Use when developers: (1) Ask about AI SDK functions like generateText, streamText, ToolLoopAgent, embed, or tools, (2) Want to build AI agents, chatbots, RAG systems, or text generation features, (3) Have questions about AI providers (OpenAI, Anthropic, Google, etc.), streaming, tool calling, structured output, or embeddings, (4) Use React hooks like useChat or useCompletion. Triggers on: "AI SDK", "Vercel AI SDK", "generateText", "streamText", "add AI to my app", "build an agent", "tool calling", "structured output", "useChat".
Leveraging AI coding assistants and tools to boost development productivity, while maintaining oversight to ensure quality results.
Master API documentation with OpenAPI 3.1, AI-powered tools, and modern developer experience practices. Create interactive docs, generate SDKs, and build comprehensive developer portals. Use PROACTIVELY for API documentation or developer portal creation.
Creates educational Teams channel posts for internal knowledge sharing about Claude Code features, tools, and best practices. Applies when writing posts, announcements, or documentation to teach colleagues effective Claude Code usage, announce new features, share productivity tips, or document lessons learned. Provides templates, writing guidelines, and structured approaches emphasizing concrete examples, underlying principles, and connections to best practices like context engineering. Activates for content involving Teams posts, channel announcements, feature documentation, or tip sharing.
Use when working with the OpenAI API (Responses API) or OpenAI platform features (tools, streaming, Realtime API, auth, models, rate limits, MCP) and you need authoritative, up-to-date documentation (schemas, examples, limits, edge cases). Prefer the OpenAI Developer Documentation MCP server tools when available; otherwise guide the user to enable `openaiDeveloperDocs`.