youtube-transcribe-skill

230views

30installs

Extract subtitles/transcripts from a YouTube video URL and save as a local file. Use when you need to extract subtitles from a YouTube video.

Install

mkdir -p .claude/skills/youtube-transcribe-skill && curl -L -o skill.zip "https://mcp.directory/api/skills/download/94" && unzip -o skill.zip -d .claude/skills/youtube-transcribe-skill && rm skill.zip

Installs to .claude/skills/youtube-transcribe-skill

About this skill

YouTube Transcript Extraction

Extract subtitles/transcripts from a YouTube video URL and save them as a local file.

Input YouTube URL: $ARGUMENTS

Step 1: Verify URL and Get Video Information

Verify URL Format: Confirm the input is a valid YouTube URL (supports youtube.com/watch?v= or youtu.be/ formats).
Get Video Information: Use WebFetch or firecrawl to fetch the page and extract the video title for subsequent file naming.

Step 2: CLI Quick Extraction (Priority Attempt)

Use command-line tools to quickly extract subtitles.

Check Tool Availability: Execute which yt-dlp.
- If yt-dlp is found, proceed to subtitle download.
- If yt-dlp is NOT found, skip immediately to Step 3.
Execute Subtitle Download (Only if yt-dlp is found):
- Tip: Always add --cookies-from-browser to avoid sign-in restrictions. Default to chrome.
- Retry Logic: If yt-dlp fails with a browser error (e.g., "Could not open Chrome"), ask the user to specify their available browser (e.g., firefox, safari, edge) and retry.
```
# Get the title first (try chrome first)
yt-dlp --cookies-from-browser=chrome --get-title "[VIDEO_URL]"

# Download subtitles
yt-dlp --cookies-from-browser=chrome --write-auto-sub --write-sub --sub-lang zh-Hans,zh-Hant,en --skip-download --output "<Video Title>.%(ext)s" "[VIDEO_URL]"
```
Verify Results:
- Check the command exit code.
- Exit code 0 (Success): Subtitles have been saved locally, task complete.
- Exit code non-0 (Failure):
  - If error is related to browser/cookies, ask user for correct browser and retry Step 2.
  - If other errors (e.g., video unavailable), proceed to Step 3.

Step 3: Browser Automation (Fallback)

When the CLI method fails or yt-dlp is missing, use browser UI automation to extract subtitles.

Check Tool Availability:
- Check if chrome-devtools-mcp tools (specifically mcp__plugin_claude-code-settings_chrome__new_page) are available.
- CRITICAL CHECK: If chrome-devtools-mcp is NOT available AND yt-dlp was NOT found in Step 2:
  - STOP execution.
  - Notify the User: "Unable to proceed. Please either install yt-dlp (for fast CLI extraction) OR configure chrome-devtools-mcp (for browser automation)."
Initialize Browser Session (If tools are available):

Call mcp__plugin_claude-code-settings_chrome__new_page to open the video URL.

3.2 Analyze Page State

Call mcp__plugin_claude-code-settings_chrome__take_snapshot to read the page accessibility tree.

3.3 Expand Video Description

Reason: The "Show transcript" button is usually hidden within the collapsed description area.

Search the snapshot for a button labeled "...more", "...更多", or "Show more" (usually located in the description block below the video title).
Call mcp__plugin_claude-code-settings_chrome__click to click that button.

3.4 Open Transcript Panel

Call mcp__plugin_claude-code-settings_chrome__take_snapshot to get the updated UI snapshot.
Search for a button labeled "Show transcript", "显示转录稿", or "内容转文字".
Call mcp__plugin_claude-code-settings_chrome__click to click that button.

3.5 Extract Content via DOM

Reason: Directly reading the accessibility tree for long lists is slow and consumes many tokens; DOM injection is more efficient.

Call mcp__plugin_claude-code-settings_chrome__evaluate_script to execute the following JavaScript:

() => {
  // Select all transcript segment containers
  const segments = document.querySelectorAll("ytd-transcript-segment-renderer");
  if (!segments.length) return "BUFFERING"; // Retry if empty

  // Iterate and format as "timestamp text"
  return Array.from(segments)
    .map((seg) => {
      const time = seg.querySelector(".segment-timestamp")?.innerText.trim();
      const text = seg.querySelector(".segment-text")?.innerText.trim();
      return `${time} ${text}`;
    })
    .join("\n");
};

If it returns "BUFFERING", wait a few seconds and retry.

3.6 Save and Cleanup

Use the Write tool to save the extracted text as a local file (e.g., <Video Title>.txt).
Call mcp__plugin_claude-code-settings_chrome__close_page to release resources.

Output Requirements

Save the subtitle file to the current working directory.
Filename format: <Video Title>.txt
File content format: Each line should be Timestamp Subtitle Text.
Report upon completion: File path, subtitle language, total number of lines.

More by feiskyer

View all skills by feiskyer →

codex-skill

feiskyer

Use when user asks to leverage codex, gpt-5, or gpt-5.1 to implement something (usually implement a plan or feature designed by Claude). Provides non-interactive automation mode for hands-off task execution without approval prompts.

1108

autonomous-skill

feiskyer

Use when user wants to execute long-running tasks that require multiple sessions to complete. This skill manages task decomposition, progress tracking, and autonomous execution using Claude Code headless mode with auto-continuation. Trigger phrases: "autonomous", "long-running task", "multi-session", "自主执行", "长时任务", "autonomous skill".

947

nanobanana-skill

feiskyer

Generate or edit images using Google Gemini API via nanobanana. Use when the user asks to create, generate, edit images with nanobanana, or mentions image generation/editing tasks.

1066

spec-kit-skill

feiskyer

GitHub Spec-Kit integration for constitution-based spec-driven development. 7-phase workflow. Triggers: "spec-kit", "speckit", "constitution", "specify", ".specify/", "规格驱动开发", "需求规格".

324

kiro-skill

feiskyer

Interactive feature development workflow from idea to implementation. Creates requirements (EARS format), design documents, and task lists. Triggers: "kiro", ".kiro/specs/", "feature spec", "需求文档", "设计文档", "实现计划".

213

flutter-development

aj-geddes

Build beautiful cross-platform mobile apps with Flutter and Dart. Covers widgets, state management with Provider/BLoC, navigation, API integration, and material design.

1,5751,370

ui-ux-pro-max

nextlevelbuilder

"UI/UX design intelligence. 50 styles, 21 palettes, 50 font pairings, 20 charts, 8 stacks (React, Next.js, Vue, Svelte, SwiftUI, React Native, Flutter, Tailwind). Actions: plan, build, create, design, implement, review, fix, improve, optimize, enhance, refactor, check UI/UX code. Projects: website, landing page, dashboard, admin panel, e-commerce, SaaS, portfolio, blog, mobile app, .html, .tsx, .vue, .svelte. Elements: button, modal, navbar, sidebar, card, table, form, chart. Styles: glassmorphism, claymorphism, minimalism, brutalism, neumorphism, bento grid, dark mode, responsive, skeuomorphism, flat design. Topics: color palette, accessibility, animation, layout, typography, font pairing, spacing, hover, shadow, gradient."

1,1181,192

drawio-diagrams-enhanced

jgtolentino

Create professional draw.io (diagrams.net) diagrams in XML format (.drawio files) with integrated PMP/PMBOK methodologies, extensive visual asset libraries, and industry-standard professional templates. Use this skill when users ask to create flowcharts, swimlane diagrams, cross-functional flowcharts, org charts, network diagrams, UML diagrams, BPMN, project management diagrams (WBS, Gantt, PERT, RACI), risk matrices, stakeholder maps, or any other visual diagram in draw.io format. This skill includes access to custom shape libraries for icons, clipart, and professional symbols.

1,4191,110

godot

bfollington

This skill should be used when working on Godot Engine projects. It provides specialized knowledge of Godot's file formats (.gd, .tscn, .tres), architecture patterns (component-based, signal-driven, resource-based), common pitfalls, validation tools, code templates, and CLI workflows. The `godot` command is available for running the game, validating scripts, importing resources, and exporting builds. Use this skill for tasks involving Godot game development, debugging scene/resource files, implementing game systems, or creating new Godot components.

1,200751

nano-banana-pro

garg-aayush

Generate and edit images using Google's Nano Banana Pro (Gemini 3 Pro Image) API. Use when the user asks to generate, create, edit, modify, change, alter, or update images. Also use when user references an existing image file and asks to modify it in any way (e.g., "modify this image", "change the background", "replace X with Y"). Supports both text-to-image generation and image-to-image editing with configurable resolution (1K default, 2K, or 4K for high resolution). DO NOT read the image file first - use this skill directly with the --input-image parameter.

1,159685

pdf-to-markdown

aliceisjustplaying

Convert entire PDF documents to clean, structured Markdown for full context loading. Use this skill when the user wants to extract ALL text from a PDF into context (not grep/search), when discussing or analyzing PDF content in full, when the user mentions "load the whole PDF", "bring the PDF into context", "read the entire PDF", or when partial extraction/grepping would miss important context. This is the preferred method for PDF text extraction over page-by-page or grep approaches.

1,329621

Related MCP Servers

Browse all servers

YouTube Transcripts

Extract and analyze YouTube transcripts in multiple languages. Use our YouTube transcriptor to easily transcribe for You

4881 tools

Fetch (Web Content & YouTube Transcripts)

Fetch is a web scraping tool that extracts web content and YouTube transcripts, converting HTML to Markdown with accurat

1572 tools

YouTube Transcript

Extract and format YouTube transcripts with language selection, paragraph formatting, and enriched metadata for analysis

301 tools

YouTube Data API

Unlock deep YouTube analytics: search videos, track channel stats, explore trends, and analyze high-performing YouTubers

140 tools

Video & Audio Text Extraction

Transcribe for YouTube and other platforms. Extract accurate transcript of a YouTube video for accessibility, analysis,

90 tools

Video Editor

AI-powered video editor that integrates Video Jungle for natural-language YouTube video search, automated clip generatio

2530 tools

Stay ahead of the MCP ecosystem

Get weekly updates on new skills and servers.

Install

mkdir -p .claude/skills/youtube-transcribe-skill && curl -L -o skill.zip "https://mcp.directory/api/skills/download/94" && unzip -o skill.zip -d .claude/skills/youtube-transcribe-skill && rm skill.zip

Installs to .claude/skills/youtube-transcribe-skill

Stats

Views

230

Installs

Author

feiskyer

6 skills published

Links

Source Code

youtube-transcribe-skill

Install

About this skill

YouTube Transcript Extraction

Step 1: Verify URL and Get Video Information

Step 2: CLI Quick Extraction (Priority Attempt)

Step 3: Browser Automation (Fallback)

3.2 Analyze Page State

3.3 Expand Video Description

3.4 Open Transcript Panel

3.5 Extract Content via DOM

3.6 Save and Cleanup

Output Requirements

More by feiskyer

codex-skill

autonomous-skill

nanobanana-skill

spec-kit-skill

kiro-skill

You might also like

flutter-development

ui-ux-pro-max

drawio-diagrams-enhanced

godot

nano-banana-pro

pdf-to-markdown

Related MCP Servers

Stay ahead of the MCP ecosystem