RapidOCR

Name: RapidOCR
Rating: 4.8 (29 reviews)
Author: z4none

Extracts text from images using the RapidOCR library. Accepts image files by path or base64-encoded data and returns recognized text.

Extracts text from images using RapidOCR library through base64-encoded data or file paths for automated document processing workflows.

5395 views3Local (stdio)

ai ml

GitHub

What it does

Extract text from image files by file path
Process base64-encoded image data for OCR
Return structured text content from images
Handle various image formats for text recognition

Best for

Document digitization workflowsAutomated text extraction from scanned documentsProcessing image-based data in pipelines

Two input methods: file path or base64No external API dependencies

About RapidOCR

RapidOCR is a community-built MCP server published by z4none that provides AI assistants with tools and capabilities via the Model Context Protocol. Extract text from images with RapidOCR. Convert image to text efficiently for automated document processing via base64 or file path input. It is categorized under ai ml.

How to install

You can install RapidOCR in your AI client of choice. Use the install panel on this page to get one-click setup for Cursor, Claude Desktop, VS Code, and other MCP-compatible clients. This server runs locally on your machine via the stdio transport.

License

RapidOCR is released under the MIT license. This is a permissive open-source license, meaning you can freely use, modify, and distribute the software.

RapidOCR MCP Server

A MCP server based on RapidOCR, providing an easy-to-use OCR interface.

Usage

uvx run rapidocr-mcp

MCP config

"rapidocr-mcp": {
    "command": "uvx",
    "args": [
    "rapidocr-mcp"
    ],
    "env": {}
}

Screenshots

config usage

Available Methods

ocr_by_content Perform OCR on an image content. Args: base64_data (str): The base64 encoded image content. Returns: List[TextContent]: A list of text content.
ocr_by_path Perform OCR on an image file. Args: path (str): The path to the image file. Returns: List[TextContent]: A list of text content.

remark

Currently, large language models are becoming increasingly powerful and generally possess multimodal capabilities, enabling them to recognize text in images without the need for external OCR tools.

Alternatives

Knowledge Graph Memory

anthropic

80.5k

Build persistent semantic networks for enterprise & engineering data management. Enable data persistence and memory across chats efficiently.

OfficialPopular

2.7k171

Context7

upstash

48.2k

Boost your AI code assistant with Context7: inject real-time API documentation from OpenAPI specification sources into your coding workflow.

OfficialRemotePopular

17.3k832

Blender

ahujasid

17.6k

Connect Blender to Claude AI for seamless 3D modeling. Use AI 3D model generator tools for faster, intuitive, interactive 3D scene creation.

CommunityPopular

3.1k52

Google GenAI Toolbox

google

13.3k

Google GenAI Toolbox: open-source GenAI database agent and AI database connector for Google Cloud database—query Cloud SQL connector, Spanner & AlloyDB with…

OfficialPopular

330