Orion Vision (Azure Form Recognizer)

Orion Vision (Azure Form Recognizer)

cognitive-stack

Extracts structured data from documents like receipts, invoices, and ID cards using Azure Form Recognizer. Automates document processing by converting images and PDFs into structured data.

Integrates with Azure Form Recognizer to extract structured data from documents including receipts, invoices, ID documents, and business cards for automated document processing workflows.

2308 views2Local (stdio)

What it does

  • Extract data from receipts and invoices
  • Process ID documents and business cards
  • Analyze various document formats
  • Convert unstructured documents to structured data
  • Integrate with Azure Document Intelligence

Best for

Businesses automating invoice processingDevelopers building document workflowsCompanies digitizing paper recordsApplications requiring form data extraction
Works with multiple MCP clientsTypeScript type safetyOne-click Cline marketplace install

About Orion Vision (Azure Form Recognizer)

Orion Vision (Azure Form Recognizer) is a community-built MCP server published by cognitive-stack that provides AI assistants with tools and capabilities via the Model Context Protocol. Automate document workflows with Orion Vision and Azure Form Recognizer for intelligent document processing and assembly It is categorized under ai ml.

How to install

You can install Orion Vision (Azure Form Recognizer) in your AI client of choice. Use the install panel on this page to get one-click setup for Cursor, Claude Desktop, VS Code, and other MCP-compatible clients. This server runs locally on your machine via the stdio transport.

License

Orion Vision (Azure Form Recognizer) is released under the MIT license. This is a permissive open-source license, meaning you can freely use, modify, and distribute the software.

Orion Vision MCP Server 🚀

🔌 Compatible with Cline, Cursor, Claude Desktop, and any other MCP Clients!

Orion Vision MCP is also compatible with any MCP client

The Model Context Protocol (MCP) is an open standard that enables AI systems to interact seamlessly with various data sources and tools, facilitating secure, two-way connections.

The Orion Vision MCP server provides:

  • Seamless integration with Azure Form Recognizer / Document Intelligence
  • Document analysis and form data extraction capabilities
  • Support for various document types (receipts, invoices, ID documents, etc.)
  • Type-safe operations with TypeScript

Prerequisites 🔧

Before you begin, ensure you have:

  • Azure Form Recognizer / Document Intelligence endpoint and key
  • Claude Desktop or Cursor
  • Node.js (v20 or higher)
  • Git installed (only needed if using Git installation method)

Orion Vision MCP server installation ⚡

Running with NPX

npx -y orion-vision-mcp@latest

Installing via Smithery

To install Orion Vision MCP Server for Claude Desktop automatically via Smithery:

npx -y @smithery/cli install @orion-vision/mcp --client claude

Configuring MCP Clients ⚙️

Configuring Cline 🤖

The easiest way to set up the Orion Vision MCP server in Cline is through the marketplace with a single click:

  1. Open Cline in VS Code
  2. Click on the Cline icon in the sidebar
  3. Navigate to the "MCP Servers" tab (4 squares)
  4. Search "Orion Vision" and click "install"
  5. When prompted, enter your Azure Form Recognizer credentials

Alternatively, you can manually set up the Orion Vision MCP server in Cline:

  1. Open the Cline MCP settings file:
# For macOS:
code ~/Library/Application\ Support/Code/User/globalStorage/saoudrizwan.claude-dev/settings/cline_mcp_settings.json

# For Windows:
code %APPDATA%\Code\User\globalStorage\saoudrizwan.claude-dev\settings\cline_mcp_settings.json
  1. Add the Orion Vision server configuration to the file:
{
  "mcpServers": {
    "orion-vision-mcp": {
      "command": "npx",
      "args": ["-y", "orion-vision-mcp@latest"],
      "env": {
        "AZURE_FORM_RECOGNIZER_ENDPOINT": "your-endpoint-here",
        "AZURE_FORM_RECOGNIZER_KEY": "your-key-here"
      },
      "disabled": false,
      "autoApprove": []
    }
  }
}
  1. Save the file and restart Cline if it's already running.

Configuring Cursor 🖥️

Note: Requires Cursor version 0.45.6 or higher

To set up the Orion Vision MCP server in Cursor:

  1. Open Cursor Settings
  2. Navigate to Features > MCP Servers
  3. Click on the "+ Add New MCP Server" button
  4. Fill out the following information:
    • Name: Enter a nickname for the server (e.g., "orion-vision-mcp")
    • Type: Select "command" as the type
    • Command: Enter the command to run the server:
    env AZURE_FORM_RECOGNIZER_ENDPOINT=your-endpoint AZURE_FORM_RECOGNIZER_KEY=your-key npx -y orion-vision-mcp@latest
    

    Important: Replace your-endpoint and your-key with your Azure Form Recognizer credentials

Configuring the Claude Desktop app 🖥️

For macOS:

# Create the config file if it doesn't exist
touch "$HOME/Library/Application Support/Claude/claude_desktop_config.json"

# Opens the config file in TextEdit
open -e "$HOME/Library/Application Support/Claude/claude_desktop_config.json"

For Windows:

code %APPDATA%\Claude\claude_desktop_config.json

Add the Orion Vision server configuration:

{
  "mcpServers": {
    "orion-vision-mcp": {
      "command": "npx",
      "args": ["-y", "orion-vision-mcp@latest"],
      "env": {
        "AZURE_FORM_RECOGNIZER_ENDPOINT": "your-endpoint-here",
        "AZURE_FORM_RECOGNIZER_KEY": "your-key-here"
      }
    }
  }
}

Usage in Claude Desktop App 🎯

Once the installation is complete, and the Claude desktop app is configured, you must completely close and re-open the Claude desktop app to see the orion-vision-mcp server. You should see a hammer icon in the bottom left of the app, indicating available MCP tools.

Orion Vision Examples

  1. Analyze a Document:
Analyze the document at "https://example.com/document.pdf" using Azure Form Recognizer.
  1. Extract Form Data:
Extract data from the invoice at "https://example.com/invoice.pdf".
  1. Process ID Document:
Process the ID document at "https://example.com/id.pdf" and extract relevant information.

Troubleshooting 🛠️

Common Issues

  1. Server Not Found

    • Verify the npm installation by running npm --version
    • Check Claude Desktop configuration syntax
    • Ensure Node.js is properly installed by running node --version
  2. Azure Form Recognizer Credentials Issues

    • Confirm your Azure Form Recognizer endpoint and key are valid
    • Check the credentials are correctly set in the config
    • Verify no spaces or quotes around the credentials
  3. Document Processing Issues

    • Verify the document URL is accessible
    • Check the document format is supported
    • Ensure the document is not corrupted or password-protected

Acknowledgments ✨

  • Model Context Protocol for the MCP specification
  • Anthropic for Claude Desktop
  • Microsoft Azure for Form Recognizer / Document Intelligence

Alternatives

Related Skills

Browse all skills
azure-deployment-preflight

Performs comprehensive preflight validation of Bicep deployments to Azure, including template syntax validation, what-if analysis, and permission checks. Use this skill before any deployment to Azure to preview changes, identify potential issues, and ensure the deployment will succeed. Activate when users mention deploying to Azure, validating Bicep files, checking deployment permissions, previewing infrastructure changes, running what-if, or preparing for azd provision.

6
terraform-module-library

Build reusable Terraform modules for AWS, Azure, and GCP infrastructure following infrastructure-as-code best practices. Use when creating infrastructure modules, standardizing cloud provisioning, or implementing reusable IaC components.

5
azure-ai-formrecognizer-java

Build document analysis applications with Azure Document Intelligence (Form Recognizer) SDK for Java. Use when extracting text, tables, key-value pairs from documents, receipts, invoices, or building custom document models.

0
azure-ai-document-intelligence-dotnet

Azure AI Document Intelligence SDK for .NET. Extract text, tables, and structured data from documents using prebuilt and custom models. Use for invoice processing, receipt extraction, ID document analysis, and custom document models. Triggers: "Document Intelligence", "DocumentIntelligenceClient", "form recognizer", "invoice extraction", "receipt OCR", "document analysis .NET".

0
azure-devops-rest-api

Guide for working with Azure DevOps REST APIs and OpenAPI specifications. Use this skill when implementing new Azure DevOps API integrations, exploring API capabilities, understanding request/response formats, or referencing the official OpenAPI specifications from the vsts-rest-api-specs repository.

14
senior-computer-vision

World-class computer vision skill for image/video processing, object detection, segmentation, and visual AI systems. Expertise in PyTorch, OpenCV, YOLO, SAM, diffusion models, and vision transformers. Includes 3D vision, video analysis, real-time processing, and production deployment. Use when building vision AI systems, implementing object detection, training custom vision models, or optimizing inference pipelines.

11