Textin MCP Server

Name: Textin MCP Server
Rating: 4.5 (37 reviews)
Author: intsig-textin

Official

Performs OCR on images, PDFs, and Word documents to extract text, convert documents to Markdown format, and extract structured information from various document types.

A server that enables OCR capabilities to recognize text from images, PDFs, and Word documents, convert them to Markdown, and extract key information.

28189 views8Local (stdio)

ai ml productivity

GitHub

What it does

Extract text from images and documents via OCR
Convert PDFs and Office documents to Markdown
Extract structured information and key-value pairs from documents
Process documents from local files or HTTP/HTTPS URLs
Handle multiple file formats including PDF, Word, Excel, and common image formats

Best for

Content creators digitizing printed materialsDevelopers building document processing workflowsResearchers extracting data from scanned documentsTeams converting legacy documents to modern formats

Supports both local files and web URLsMultiple output formats (plain text, Markdown, structured JSON)

About Textin MCP Server

Textin MCP Server is an official MCP server published by intsig-textin that provides AI assistants with tools and capabilities via the Model Context Protocol. Textin MCP Server: OCR to Markdown for images, PDFs & Word docs — fast document OCR, PDF OCR converter and OCR data extraction to extract key info. It is categorized under ai ml, productivity.

How to install

You can install Textin MCP Server in your AI client of choice. Use the install panel on this page to get one-click setup for Cursor, Claude Desktop, VS Code, and other MCP-compatible clients. This server runs locally on your machine via the stdio transport.

License

Textin MCP Server is released under the MIT license. This is a permissive open-source license, meaning you can freely use, modify, and distribute the software.

TextIn OCR MCP

English | 中文

TextIn OCR MCP Server

TextIn MCP Server is a tool for extracting text and performing OCR on documents, including document text recognition, ID recognition, and invoice recognition. It also supports converting documents into Markdown format.

Tools

recognition_text
- Text recognition from images, Word documents, and PDF files.
- Inputs:
  - path (string, required): file path or a URL (HTTP/HTTPS) pointing to a document
- Return: Text of the document.
- Supports conversion for:
  - PDF
  - Image (Jpeg, Jpg, Png, Bmp)
doc_to_markdown
- Convert images, PDFs, and Word documents to Markdown.
- Inputs:
  - path (string, required): file path or a URL (HTTP/HTTPS) pointing to a document
- Return: Markdown of the document.
- Supports conversion for:
  - PDF
  - Microsoft Office Documents (Word, Excel)
  - Image (Jpeg, Jpg, Png, Bmp)
general_information_extration
- Automatically identify and extract information from documents, or identify and extract user-specified information.
- Inputs:
  - path (string, required): file path or a URL (HTTP/HTTPS) pointing to a document
  - key (string[], optional): The non-tabular text information that the user wants to identify, input format is an array of strings.
  - table_header (string[], optional): The table information that the user wants to identify, input format is an array of strings.
- Return: The key information JSON.
- Supports conversion for:
  - PDF
  - Microsoft Office Documents (Word, Excel)
  - Image (Jpeg, Jpg, Png, Bmp)

When the input is a URL, it does not support handling access to protected resources.

Setup

APP_ID and APP_SECRET

Click here to register for a TextIn account.

Get Textin APP_ID and APP_SECRET by following the instructions here.

NPX

{
  "mcpServers": {
    "textin-ocr": {
      "command": "npx",
      "args": [
        "-y",
        "@intsig/server-textin"
      ],
      "env": {
        "APP_ID": "<YOUR_APP_ID>",
        "APP_SECRET": "<YOUR_APP_SECRET>",
        "MCP_SERVER_REQUEST_TIMEOUT": "600000"
      },
      "timeout": 600
    }
  }
}

License

This MCP server is licensed under the MIT License. This means you are free to use, modify, and distribute the software, subject to the terms and conditions of the MIT License. For more details, please see the LICENSE file in the project repository.

Alternatives

Knowledge Graph Memory

anthropic

80.5k

Build persistent semantic networks for enterprise & engineering data management. Enable data persistence and memory across chats efficiently.

OfficialPopular

2.7k171

Context7

upstash

48.2k

Boost your AI code assistant with Context7: inject real-time API documentation from OpenAPI specification sources into your coding workflow.

OfficialRemotePopular

17.3k832

GitHub

github

27.6k

Extend your developer tools with GitHub MCP Server for advanced automation, supporting GitHub Student and student packages integration.

OfficialRemotePopular

4.8k268

Task Master

eyaltoledano

25.8k

Boost productivity with Task Master: an AI-powered tool for project management and agile development workflows, integrated with popular editors.

CommunityPopular

5.1k115

Related Skills

Browse all skills

chief-architect

PERSONAL APP ARCHITECT - Strategic development orchestrator for personal productivity applications. Analyzes project context, makes architectural decisions for single-developer projects, delegates to specialized skills, and ensures alignment between user experience goals and technical implementation. Optimized for personal apps targeting 10-100 users.

cto-engineering-metrics

Expert methodology for defining, tracking, and interpreting engineering performance metrics including DORA, team health, productivity, and executive reporting.

teams-channel-post-writer

Creates educational Teams channel posts for internal knowledge sharing about Claude Code features, tools, and best practices. Applies when writing posts, announcements, or documentation to teach colleagues effective Claude Code usage, announce new features, share productivity tips, or document lessons learned. Provides templates, writing guidelines, and structured approaches emphasizing concrete examples, underlying principles, and connections to best practices like context engineering. Activates for content involving Teams posts, channel announcements, feature documentation, or tip sharing.

ai-assisted-development

Leveraging AI coding assistants and tools to boost development productivity, while maintaining oversight to ensure quality results.

productivity-helper

Boost your productivity with automated task management

cursor-local-dev-loop

Optimize local development workflow with Cursor. Triggers on "cursor workflow", "cursor development loop", "cursor productivity", "cursor daily workflow". Use when working with cursor local dev loop functionality. Trigger with phrases like "cursor local dev loop", "cursor loop", "cursor".