Textin MCP Server

Textin MCP Server

Official
intsig-textin

Performs OCR on images, PDFs, and Word documents to extract text, convert documents to Markdown format, and extract structured information from various document types.

A server that enables OCR capabilities to recognize text from images, PDFs, and Word documents, convert them to Markdown, and extract key information.

28185 views8Local (stdio)

What it does

  • Extract text from images and documents via OCR
  • Convert PDFs and Office documents to Markdown
  • Extract structured information and key-value pairs from documents
  • Process documents from local files or HTTP/HTTPS URLs
  • Handle multiple file formats including PDF, Word, Excel, and common image formats

Best for

Content creators digitizing printed materialsDevelopers building document processing workflowsResearchers extracting data from scanned documentsTeams converting legacy documents to modern formats
Supports both local files and web URLsMultiple output formats (plain text, Markdown, structured JSON)

About Textin MCP Server

Textin MCP Server is an official MCP server published by intsig-textin that provides AI assistants with tools and capabilities via the Model Context Protocol. Textin MCP Server: OCR to Markdown for images, PDFs & Word docs — fast document OCR, PDF OCR converter and OCR data extr It is categorized under ai ml, productivity.

How to install

You can install Textin MCP Server in your AI client of choice. Use the install panel on this page to get one-click setup for Cursor, Claude Desktop, VS Code, and other MCP-compatible clients. This server runs locally on your machine via the stdio transport.

License

Textin MCP Server is released under the MIT license. This is a permissive open-source license, meaning you can freely use, modify, and distribute the software.

TextIn OCR MCP

TextIn

English | 中文

TextIn OCR MCP Server

TextIn MCP Server is a tool for extracting text and performing OCR on documents, including document text recognition, ID recognition, and invoice recognition. It also supports converting documents into Markdown format.

Tools

  • recognition_text

    • Text recognition from images, Word documents, and PDF files.
    • Inputs:
      • path (string, required): file path or a URL (HTTP/HTTPS) pointing to a document
    • Return: Text of the document.
    • Supports conversion for:
      • PDF
      • Image (Jpeg, Jpg, Png, Bmp)
  • doc_to_markdown

    • Convert images, PDFs, and Word documents to Markdown.
    • Inputs:
      • path (string, required): file path or a URL (HTTP/HTTPS) pointing to a document
    • Return: Markdown of the document.
    • Supports conversion for:
      • PDF
      • Microsoft Office Documents (Word, Excel)
      • Image (Jpeg, Jpg, Png, Bmp)
  • general_information_extration

    • Automatically identify and extract information from documents, or identify and extract user-specified information.
    • Inputs:
      • path (string, required): file path or a URL (HTTP/HTTPS) pointing to a document
      • key (string[], optional): The non-tabular text information that the user wants to identify, input format is an array of strings.
      • table_header (string[], optional): The table information that the user wants to identify, input format is an array of strings.
    • Return: The key information JSON.
    • Supports conversion for:
      • PDF
      • Microsoft Office Documents (Word, Excel)
      • Image (Jpeg, Jpg, Png, Bmp)

When the input is a URL, it does not support handling access to protected resources.

Setup

APP_ID and APP_SECRET

Click here to register for a TextIn account.

Get Textin APP_ID and APP_SECRET by following the instructions here.

NPX

{
  "mcpServers": {
    "textin-ocr": {
      "command": "npx",
      "args": [
        "-y",
        "@intsig/server-textin"
      ],
      "env": {
        "APP_ID": "<YOUR_APP_ID>",
        "APP_SECRET": "<YOUR_APP_SECRET>",
        "MCP_SERVER_REQUEST_TIMEOUT": "600000"
      },
      "timeout": 600
    }
  }
}

License

This MCP server is licensed under the MIT License. This means you are free to use, modify, and distribute the software, subject to the terms and conditions of the MIT License. For more details, please see the LICENSE file in the project repository.

Alternatives

Related Skills

Browse all skills
ai-assisted-development

Leveraging AI coding assistants and tools to boost development productivity, while maintaining oversight to ensure quality results.

4
teams-channel-post-writer

Creates educational Teams channel posts for internal knowledge sharing about Claude Code features, tools, and best practices. Applies when writing posts, announcements, or documentation to teach colleagues effective Claude Code usage, announce new features, share productivity tips, or document lessons learned. Provides templates, writing guidelines, and structured approaches emphasizing concrete examples, underlying principles, and connections to best practices like context engineering. Activates for content involving Teams posts, channel announcements, feature documentation, or tip sharing.

4
cto-engineering-metrics

Expert methodology for defining, tracking, and interpreting engineering performance metrics including DORA, team health, productivity, and executive reporting.

4
personal-assistant

This skill should be used whenever users request personal assistance tasks such as schedule management, task tracking, reminder setting, habit monitoring, productivity advice, time management, or any query requiring personalized responses based on user preferences and context. On first use, collects comprehensive user information including schedule, working habits, preferences, goals, and routines. Maintains an intelligent database that automatically organizes and prioritizes information, keeping relevant data and discarding outdated context.

3
productivity-helper

Boost your productivity with automated task management

2
cursor-local-dev-loop

Optimize local development workflow with Cursor. Triggers on "cursor workflow", "cursor development loop", "cursor productivity", "cursor daily workflow". Use when working with cursor local dev loop functionality. Trigger with phrases like "cursor local dev loop", "cursor loop", "cursor".

2