Page/Image Scanner

Name: Page/Image Scanner
Rating: 4.7 (23 reviews)
Author: jacksenechal

Provides MCP tools to control SANE-compatible scanners for document digitization. Handles automatic device selection, batch scanning, duplex mode, and multi-page TIFF assembly.

Integrates with SANE-compatible scanners to provide document capture with ADF support, duplex scanning, batch processing, and automatic multi-page TIFF assembly for digitizing physical documents and building document management workflows.

5205 views4Local (stdio)

developer tools

GitHub

What it does

Discover available SANE scanners
Start scan jobs with ADF and duplex support
Batch process multiple pages automatically
Assemble multi-page TIFF documents
Configure scan settings like page size and resolution
Monitor scan job progress and events

Best for

Building document management workflowsDigitizing physical document archivesAutomating office document captureDevelopers integrating scanning into applications

Local-first privacy with stdio transportSmart device selection prefers ADF/duplexLinux SANE backend integration

About Page/Image Scanner

Page/Image Scanner is a community-built MCP server published by jacksenechal that provides AI assistants with tools and capabilities via the Model Context Protocol. Digitize documents with Page/Image Scanner—supports ADF, duplex, batch, and multi-page TIFF for seamless document scanni It is categorized under developer tools.

How to install

You can install Page/Image Scanner in your AI client of choice. Use the install panel on this page to get one-click setup for Cursor, Claude Desktop, VS Code, and other MCP-compatible clients. This server runs locally on your machine via the stdio transport.

License

Page/Image Scanner is released under the NOASSERTION license.

scan-mcp

node-current

Minimal MCP server for scanner capture (ADF/duplex/page-size), batching, and multipage assembly.

Features

Small, typed MCP server exposing tools for device discovery and scan jobs
JSON Schema–validated inputs with deterministic, typed outputs
Smart device selection (prefers ADF/duplex, avoids camera backends), robust defaults
Local-first transports: stdio by default to keep everything on-device, optional HTTP for your own network deployments

Note: This package targets Node 22 and Linux SANE backends (scanimage).

Quick Start (local stdio, default)

Add a server entry to your MCP client configuration:

{
  "mcpServers": {
    "scan": {
      "command": "npx",
      "args": [
        "-y",
        "scan-mcp"
      ],
      "env": {
        "INBOX_DIR": "~/Documents/scanned_documents/inbox"
      }
    }
  }
}

This invocation runs over stdio for a privacy-first, single-machine setup.
Call start_scan_job without a device_id to auto-select a scanner and begin scanning.
Artifacts are written under INBOX_DIR per job: job-*/page_*.tiff, doc_*.tiff, manifest.json, events.jsonl.

Streamable HTTP transport

Prefer to attach the scanner to another machine on your network? scan-mcp also supports the streamable HTTP transport:

scan-mcp --http

Default port is 3001; set MCP_HTTP_PORT to override (for example MCP_HTTP_PORT=3333 scan-mcp --http).
HTTP responses use server-sent events (SSE) for streaming tool output; clients such as Claude Desktop and Windsurf support this transport.
There is currently no authentication; this is intended for internal LAN networking

Install

Run with npx: npx scan-mcp (recommended)
- The CLI runs a quick preflight check for Node 22+ and required scanner/image tools and prints installation hints if anything is missing.
- See recommended server config above
Use npx scan-mcp --http to launch the streamable HTTP transport when running on another machine.
CLI help: scan-mcp --help
From source (for development):
- npm install
- npm run build
For Cline setup, and other automated agentic installation, see llms-install.md

System Requirements

Linux with SANE utilities: scanimage (and optionally scanadf)
TIFF tools: tiffcp (preferred) or ImageMagick convert

Environment Variables

SCAN_MOCK (default: false): mock SANE calls and generate fake TIFFs for testing.
INBOX_DIR (default: scanned_documents/inbox): base directory for job runs and artifacts.
SCANIMAGE_BIN / SCANADF_BIN (defaults: scanimage / scanadf): override binary paths.
TIFFCP_BIN / IM_CONVERT_BIN (defaults: tiffcp / convert): multipage assembly tools.
SCAN_EXCLUDE_BACKENDS (CSV): backends to exclude (e.g., v4l).
SCAN_PREFER_BACKENDS (CSV): preferred backends (e.g., epjitsu,epson2).
PERSIST_LAST_USED_DEVICE (default: true): persist and lightly prefer last used device.
MCP_HTTP_PORT (default: 3001): TCP port for the HTTP transport.

API

Tools

list_devices
- Discover connected scanners with backend details.
- Inputs: none.
get_device_options
- Get SANE options for a specific device.
- Inputs:
  - device_id (string): Target device identifier.
start_scan_job
- Begin a scanning job; omitting device_id triggers auto-selection and default options.
- Inputs (all optional unless noted):
  - device_id (string)
  - resolution_dpi (integer, 50–1200)
  - color_mode (Color | Gray | Lineart)
  - source (Flatbed | ADF | ADF Duplex)
  - duplex (boolean)
  - page_size (Letter | A4 | Legal | Custom)
  - custom_size_mm { width, height }
  - doc_break_policy { type, blank_threshold, page_count, timer_ms, barcode_values }
  - output_format (string, default tiff)
  - tmp_dir (string)
get_job_status
- Inspect job state and artifact counts.
- Inputs:
  - job_id (string)
cancel_job
- Request job cancellation; best effort during scan loops.
- Inputs:
  - job_id (string)
list_jobs
- List recent jobs from the inbox directory.
- Inputs (optional):
  - limit (integer, max 100)
  - state (running | completed | cancelled | error | unknown)
get_manifest
- Fetch a job's manifest.json.
- Inputs:
  - job_id (string)
get_events
- Retrieve a job's events.jsonl log.
- Inputs:
  - job_id (string)

See JSON Schemas in schemas/ for input shapes. Tests assert against these contracts.

How Selection and Defaults Work

Defaults aim for 300dpi, reasonable color mode, and ADF/duplex when available. Full details on scoring and fallbacks live in docs:

Selection and defaults: docs/SELECTION.md

Project Layout

src/mcp.ts — MCP server entry and tool registration
src/services/* — hardware interface and job orchestration
schemas/ — JSON Schemas used for validation and tests
docs/ — architecture, conventions, and deep dives

Development

npm run dev (stdio MCP server), npm run dev:http (HTTP transport)
make verify runs lint, typecheck, and tests
Conventions: docs/CONVENTIONS.md and architecture in docs/BLUEPRINT.md

Roadmap

Tracking ideas and future improvements are documented in docs/ROADMAP.md.

Alternatives

Chrome DevTools MCP

chromedevtools

28.1k

AI-driven control of live Chrome via Chrome DevTools: browser automation, debugging, performance analysis and network mo

OfficialPopular

50711

Chrome DevTools

chromedevtools

28.1k

Use Chrome DevTools for web site test speed, debugging, and performance analysis. The essential chrome developer tools f

OfficialPopular

3.9k172

GitHub

github

27.6k

Extend your developer tools with GitHub MCP Server for advanced automation, supporting GitHub Student and student packag

OfficialRemotePopular

4.5k232

Repomix

yamadashy

22.3k

Optimize your codebase for AI with Repomix—transform, compress, and secure repos for easier analysis with modern AI tool

OfficialPopular

1.0k7

Related Skills

Browse all skills

ui-design-system

UI design system toolkit for Senior UI Designer including design token generation, component documentation, responsive design calculations, and developer handoff tools. Use for creating design systems, maintaining visual consistency, and facilitating design-dev collaboration.

ai-sdk

Answer questions about the AI SDK and help build AI-powered features. Use when developers: (1) Ask about AI SDK functions like generateText, streamText, ToolLoopAgent, embed, or tools, (2) Want to build AI agents, chatbots, RAG systems, or text generation features, (3) Have questions about AI providers (OpenAI, Anthropic, Google, etc.), streaming, tool calling, structured output, or embeddings, (4) Use React hooks like useChat or useCompletion. Triggers on: "AI SDK", "Vercel AI SDK", "generateText", "streamText", "add AI to my app", "build an agent", "tool calling", "structured output", "useChat".

api-documenter

Master API documentation with OpenAPI 3.1, AI-powered tools, and modern developer experience practices. Create interactive docs, generate SDKs, and build comprehensive developer portals. Use PROACTIVELY for API documentation or developer portal creation.

openai-knowledge

Use when working with the OpenAI API (Responses API) or OpenAI platform features (tools, streaming, Realtime API, auth, models, rate limits, MCP) and you need authoritative, up-to-date documentation (schemas, examples, limits, edge cases). Prefer the OpenAI Developer Documentation MCP server tools when available; otherwise guide the user to enable `openaiDeveloperDocs`.

cli-builder

Guide for building TypeScript CLIs with Bun. Use when creating command-line tools, adding subcommands to existing CLIs, or building developer tooling. Covers argument parsing, subcommand patterns, output formatting, and distribution.

ydc-ai-sdk-integration

Integrate Vercel AI SDK applications with You.com tools (web search, AI agent, content extraction). Use when developer mentions AI SDK, Vercel AI SDK, generateText, streamText, or You.com integration with AI SDK.