feishu-doc-reader

25views

3installs

Read and extract content from Feishu (Lark) documents using the official Feishu Open API

Install

mkdir -p .claude/skills/feishu-doc-reader && curl -L -o skill.zip "https://mcp.directory/api/skills/download/2525" && unzip -o skill.zip -d .claude/skills/feishu-doc-reader && rm skill.zip

Installs to .claude/skills/feishu-doc-reader

About this skill

Feishu Document Reader

This skill enables reading and extracting content from Feishu (Lark) documents using the official Feishu Open API.

Configuration

Set Up the Skill

Create the configuration file at ./reference/feishu_config.json with your Feishu app credentials:

{
  "app_id": "your_feishu_app_id_here",
  "app_secret": "your_feishu_app_secret_here"
}

Make sure the scripts are executable:

chmod +x scripts/read_doc.sh
chmod +x scripts/get_blocks.sh

Security Note: The configuration file should be kept secure and not committed to version control. Consider using proper file permissions (chmod 600 ./reference/feishu_config.json).

Usage

Basic Document Reading

To read a Feishu document, you need the document token (found in the URL: https://example.feishu.cn/docx/DOC_TOKEN).

Using the shell script (recommended):

# Make sure environment variables are set first
./scripts/read_doc.sh "your_doc_token_here"

# Or specify document type explicitly
./scripts/read_doc.sh "docx_token" "doc"
./scripts/read_doc.sh "sheet_token" "sheet"

Get Detailed Document Blocks (NEW)

For complete document structure with all blocks, use the dedicated blocks script:

# Get full document blocks structure
./scripts/get_blocks.sh "docx_AbCdEfGhIjKlMnOpQrStUv"

# Get specific block by ID
./scripts/get_blocks.sh "docx_token" "block_id"

# Get blocks with children
./scripts/get_blocks.sh "docx_token" "" "true"

Using Python directly for blocks:

python scripts/get_feishu_doc_blocks.py --doc-token "your_doc_token_here"
python scripts/get_feishu_doc_blocks.py --doc-token "docx_token" --block-id "block_id"
python scripts/get_feishu_doc_blocks.py --doc-token "docx_token" --include-children

Supported Document Types

Docx documents (new Feishu docs): Full content extraction with blocks, metadata, and structure
Doc documents (legacy): Basic metadata and limited content
Sheets: Full spreadsheet data extraction with sheet navigation
Slides: Basic metadata (content extraction requires additional permissions)

Features

Enhanced Content Extraction

Structured output: Clean JSON with document metadata, content blocks, and hierarchy
Complete blocks access: Full access to all document blocks including text, tables, images, headings, lists, etc.
Block hierarchy: Proper parent-child relationships between blocks
Text extraction: Automatic text extraction from complex block structures
Table support: Proper table parsing with row/column structure
Image handling: Image URLs and metadata extraction
Link resolution: Internal and external link extraction

Block Types Supported

text: Plain text and rich text content
heading1/2/3: Document headings with proper hierarchy
bullet/ordered: List items with nesting support
table: Complete table structures with cells and formatting
image: Image blocks with tokens and metadata
quote: Block quotes
code: Code blocks with language detection
equation: Mathematical equations
divider: Horizontal dividers
page: Page breaks (in multi-page documents)

Error Handling & Diagnostics

Detailed error messages: Clear explanations for common issues
Permission validation: Checks required permissions before making requests
Token validation: Validates document tokens before processing
Retry logic: Automatic retries for transient network errors
Rate limiting: Handles API rate limits gracefully

Security Features

Secure credential storage: Supports both environment variables and secure file storage
No credential logging: Credentials never appear in logs or output
Minimal permissions: Uses only required API permissions
Access token caching: Efficient token reuse to minimize API calls

Command Line Options

Main Document Reader

# Python script options
python scripts/read_feishu_doc.py --help

# Shell script usage
./scripts/read_doc.sh <doc_token> [doc|sheet|slide]

Blocks Reader (NEW)

# Get full document blocks
./scripts/get_blocks.sh <doc_token>

# Get specific block
./scripts/get_blocks.sh <doc_token> <block_id>

# Include children blocks
./scripts/get_blocks.sh <doc_token> "" true

# Python options
python scripts/get_feishu_doc_blocks.py --help

API Permissions Required

Your Feishu app needs the following permissions:

docx:document:readonly - Read document content
doc:document:readonly - Read legacy document content
sheets:spreadsheet:readonly - Read spreadsheet content

Error Handling

Common errors and solutions:

403 Forbidden: Check app permissions and document sharing settings
404 Not Found: Verify document token is correct and document exists
Token expired: Access tokens are valid for 2 hours, refresh as needed
App ID/Secret invalid: Double-check your credentials in Feishu Open Platform
Insufficient permissions: Ensure your app has the required API permissions
99991663: Application doesn't have permission to access the document
99991664: Document doesn't exist or has been deleted
99991668: Token expired, need to refresh

Examples

Extract document with full structure

# Read document
./scripts/read_doc.sh "docx_AbCdEfGhIjKlMnOpQrStUv"

Get complete document blocks (NEW)

# Get all blocks with full structure
./scripts/get_blocks.sh "docx_AbCdEfGhIjKlMnOpQrStUv"

# Get specific block details
./scripts/get_blocks.sh "docx_AbCdEfGhIjKlMnOpQrStUv" "blk_xxxxxxxxxxxxxx"

Process spreadsheet data

./scripts/read_doc.sh "sheet_XyZ123AbCdEfGhIj" "sheet"

Extract only text content (Python script)

python scripts/read_feishu_doc.py --doc-token "docx_token" --extract-text-only

Security Notes

Never commit credentials: Keep app secrets out of version control
Use minimal permissions: Only request permissions your use case requires
Secure file permissions: Set proper file permissions on secret files (chmod 600)
Environment isolation: Use separate apps for development and production
Audit access: Regularly review which documents your app can access

Troubleshooting

Authentication Issues

Verify your App ID and App Secret in Feishu Open Platform
Ensure the app has been published with required permissions
Check that environment variables or config files are properly set
Test with the test_auth.py script to verify credentials

Document Access Issues

Ensure the document is shared with your app or in an accessible space
Verify the document token format (should start with docx_, doc_, or sheet_)
Check if the document requires additional sharing permissions

Network Issues

Ensure your server can reach open.feishu.cn
Check firewall rules if running in restricted environments
The script includes retry logic for transient network failures

Blocks-Specific Issues

Empty blocks response: Document might be empty or have no accessible blocks
Missing block types: Some block types require additional permissions
Incomplete hierarchy: Use --include-children flag for complete block tree

References

More by openclaw

View all skills by openclaw →

a-stock-analysis

openclaw

A股实时行情与分时量能分析。获取沪深股票实时价格、涨跌、成交量，分析分时量能分布（早盘/尾盘放量）、主力动向（抢筹/出货信号）、涨停封单。支持持仓管理和盈亏分析。Use when: (1) 查询A股实时行情, (2) 分析主力资金动向, (3) 查看分时成交量分布, (4) 管理股票持仓, (5) 分析持仓盈亏。

316125

Creates formal academic research papers following IEEE/ACM formatting standards with proper structure, citations, and scholarly writing style. Use when the user asks to write a research paper, academic paper, or conference paper on any topic.

4773

gog

openclaw

Google Workspace CLI for Gmail, Calendar, Drive, Contacts, Sheets, and Docs.

16470

seedream-image-gen

openclaw

Generate images via Seedream API (doubao-seedream models). Synchronous generation.

4062

weread

openclaw

WeChat Reading (微信读书) CLI tool for fetching notes and highlights. Use when: (1) user asks about weread/微信读书 notes or highlights, (2) fetching today's or recent reading notes, (3) exporting book highlights, (4) managing reading bookshelf, (5) any task involving reading notes from WeChat Reading.

5061

keyword-research

openclaw

Discovers high-value keywords with search intent analysis, difficulty assessment, and content opportunity mapping. Essential for starting any SEO or GEO content strategy.

27857

flutter-development

aj-geddes

Build beautiful cross-platform mobile apps with Flutter and Dart. Covers widgets, state management with Provider/BLoC, navigation, API integration, and material design.

1,6851,430

ui-ux-pro-max

nextlevelbuilder

"UI/UX design intelligence. 50 styles, 21 palettes, 50 font pairings, 20 charts, 8 stacks (React, Next.js, Vue, Svelte, SwiftUI, React Native, Flutter, Tailwind). Actions: plan, build, create, design, implement, review, fix, improve, optimize, enhance, refactor, check UI/UX code. Projects: website, landing page, dashboard, admin panel, e-commerce, SaaS, portfolio, blog, mobile app, .html, .tsx, .vue, .svelte. Elements: button, modal, navbar, sidebar, card, table, form, chart. Styles: glassmorphism, claymorphism, minimalism, brutalism, neumorphism, bento grid, dark mode, responsive, skeuomorphism, flat design. Topics: color palette, accessibility, animation, layout, typography, font pairing, spacing, hover, shadow, gradient."

1,2681,335

drawio-diagrams-enhanced

jgtolentino

Create professional draw.io (diagrams.net) diagrams in XML format (.drawio files) with integrated PMP/PMBOK methodologies, extensive visual asset libraries, and industry-standard professional templates. Use this skill when users ask to create flowcharts, swimlane diagrams, cross-functional flowcharts, org charts, network diagrams, UML diagrams, BPMN, project management diagrams (WBS, Gantt, PERT, RACI), risk matrices, stakeholder maps, or any other visual diagram in draw.io format. This skill includes access to custom shape libraries for icons, clipart, and professional symbols.

1,5441,153

godot

bfollington

This skill should be used when working on Godot Engine projects. It provides specialized knowledge of Godot's file formats (.gd, .tscn, .tres), architecture patterns (component-based, signal-driven, resource-based), common pitfalls, validation tools, code templates, and CLI workflows. The `godot` command is available for running the game, validating scripts, importing resources, and exporting builds. Use this skill for tasks involving Godot game development, debugging scene/resource files, implementing game systems, or creating new Godot components.

1,357809

nano-banana-pro

garg-aayush

Generate and edit images using Google's Nano Banana Pro (Gemini 3 Pro Image) API. Use when the user asks to generate, create, edit, modify, change, alter, or update images. Also use when user references an existing image file and asks to modify it in any way (e.g., "modify this image", "change the background", "replace X with Y"). Supports both text-to-image generation and image-to-image editing with configurable resolution (1K default, 2K, or 4K for high resolution). DO NOT read the image file first - use this skill directly with the --input-image parameter.

1,264728

pdf-to-markdown

aliceisjustplaying

Convert entire PDF documents to clean, structured Markdown for full context loading. Use this skill when the user wants to extract ALL text from a PDF into context (not grep/search), when discussing or analyzing PDF content in full, when the user mentions "load the whole PDF", "bring the PDF into context", "read the entire PDF", or when partial extraction/grepping would miss important context. This is the preferred method for PDF text extraction over page-by-page or grep approaches.

1,491684

Related MCP Servers

Browse all servers

Firecrawl

Unlock AI-ready web data with Firecrawl: scrape any website, handle dynamic content, and automate web scraping for resea

89,5930 tools

DeepWiki

DeepWiki converts deepwiki.com pages into clean Markdown, with fast, secure extraction—perfect as a PDF text, page, or i

1,2790 tools

Ref Tools

Boost AI coding agents with Ref Tools—efficient documentation access for faster, smarter code generation than GitHub Cop

1,0040 tools

Web Fetcher

Web Fetcher uses Playwright for reliable data web scraping and extraction from JavaScript-heavy websites, returning clea

1,0023 tools

Read Website Fast

Extract web content and convert to clean Markdown. Fast data extraction from web pages with caching, robots.txt support,

1351 tools

GetWeb

GetWeb offers reliable web scraping and content extraction. Scrape any website with advanced internet scraping and filte

130 tools

Install

mkdir -p .claude/skills/feishu-doc-reader && curl -L -o skill.zip "https://mcp.directory/api/skills/download/2525" && unzip -o skill.zip -d .claude/skills/feishu-doc-reader && rm skill.zip

Installs to .claude/skills/feishu-doc-reader

Stats

Views

Installs

Author

openclaw

7 skills published

Links

Source Code

feishu-doc-reader

Install

About this skill

Feishu Document Reader

Configuration

Set Up the Skill

Usage

Basic Document Reading

Get Detailed Document Blocks (NEW)

Supported Document Types

Features

Enhanced Content Extraction

Block Types Supported

Error Handling & Diagnostics

Security Features

Command Line Options

Main Document Reader

Blocks Reader (NEW)

API Permissions Required

Error Handling

Examples

Extract document with full structure

Get complete document blocks (NEW)

Process spreadsheet data

Extract only text content (Python script)

Security Notes

Troubleshooting

Authentication Issues

Document Access Issues

Network Issues

Blocks-Specific Issues

References

More by openclaw

a-stock-analysis

research-paper-writer

gog

seedream-image-gen

weread

keyword-research

You might also like

flutter-development

ui-ux-pro-max

drawio-diagrams-enhanced

godot

nano-banana-pro

pdf-to-markdown

Related MCP Servers