RapidOCR

RapidOCR

z4none

Extracts text from images using the RapidOCR library. Accepts image files by path or base64-encoded data and returns recognized text.

Extracts text from images using RapidOCR library through base64-encoded data or file paths for automated document processing workflows.

5382 views3Local (stdio)

What it does

  • Extract text from image files by file path
  • Process base64-encoded image data for OCR
  • Return structured text content from images
  • Handle various image formats for text recognition

Best for

Document digitization workflowsAutomated text extraction from scanned documentsProcessing image-based data in pipelines
Two input methods: file path or base64No external API dependencies

About RapidOCR

RapidOCR is a community-built MCP server published by z4none that provides AI assistants with tools and capabilities via the Model Context Protocol. Extract text from images with RapidOCR. Convert image to text efficiently for automated document processing via base64 o It is categorized under ai ml.

How to install

You can install RapidOCR in your AI client of choice. Use the install panel on this page to get one-click setup for Cursor, Claude Desktop, VS Code, and other MCP-compatible clients. This server runs locally on your machine via the stdio transport.

License

RapidOCR is released under the MIT license. This is a permissive open-source license, meaning you can freely use, modify, and distribute the software.

RapidOCR MCP Server

A MCP server based on RapidOCR, providing an easy-to-use OCR interface.

Usage

uvx run rapidocr-mcp

MCP config

"rapidocr-mcp": {
    "command": "uvx",
    "args": [
    "rapidocr-mcp"
    ],
    "env": {}
}

Screenshots

config usage

Available Methods

  • ocr_by_content Perform OCR on an image content. Args: base64_data (str): The base64 encoded image content. Returns: List[TextContent]: A list of text content.

  • ocr_by_path Perform OCR on an image file. Args: path (str): The path to the image file. Returns: List[TextContent]: A list of text content.

remark

Currently, large language models are becoming increasingly powerful and generally possess multimodal capabilities, enabling them to recognize text in images without the need for external OCR tools.

Alternatives