YouTube Transcriber

YouTube to LLM-ready transcript in one click. Runs locally, costs nothing.

How_to_transcribe.mp4

Get Running in 60 Seconds

git clone https://github.com/lifesized/youtube-transcriber.git
cd youtube-transcriber
npm run setup
npm run dev

Open http://localhost:19720 — paste a YouTube URL, hit Extract, done.

npm run setup installs all dependencies (yt-dlp, ffmpeg, Whisper, MLX on Apple Silicon) and configures everything automatically. Requires Node.js 18+, Python 3.8+, and a package manager (Homebrew / apt / dnf / pacman).

Use as a Skill

Two options — pick the one that fits your setup:

Full Skill (all features, requires running service)

Claude_Desktop_Transcribe_Summarize.mp4

# Claude Code
cp -r contrib/claude-code ~/.claude/skills/youtube-transcriber

# OpenClaw
cp -r contrib/openclaw ~/.openclaw/skills/youtube-transcriber

Requires the app running at localhost:19720 (npm run dev). Gives you Whisper fallback, speaker diarization, persistent library, and all API features.

Lite Skill (zero setup, just yt-dlp)

cp contrib/claude-code/SKILL-lite.md ~/.claude/skills/youtube-transcriber/SKILL.md

Works with just yt-dlp installed — no server needed. Extracts YouTube subtitles directly. Automatically upgrades to the full service when it detects it running.

	Lite	Full
YouTube captions	Yes	Yes
Auto-generated subs	Yes	Yes
Whisper transcription	—	Yes
Speaker diarization	—	Yes
Persistent library	—	Yes
Setup	`yt-dlp` only	Node + Python + service

Triggers

"summarize https://youtube.com/watch?v=..." "transcribe https://youtube.com/watch?v=..." "s https://youtube.com/watch?v=..." / "t https://youtube.com/watch?v=..." / "ts https://youtube.com/watch?v=..."

Or just paste a YouTube URL — the skill auto-activates.

Use as an MCP Server

Works with Claude Desktop, Cursor, and any MCP-compatible client. During npm run setup, choose "yes" when prompted to install the MCP server.

npm run mcp:config   # prints config with your absolute path

Add the output to your client config (full setup guide):

Client	Config file
Claude Desktop	`~/Library/Application Support/Claude/claude_desktop_config.json`
Claude Code	`.claude/mcp.json` or `claude mcp add`
Cursor	`.cursor/mcp.json`

Available tools: transcribe, transcribe_and_summarize, list_transcripts, search_transcripts, get_transcript, delete_transcript, summarize_transcript

How It Works

Paste a URL. The app grabs the transcript using the fastest method available on your system:

YouTube Captions — fetches official captions when they exist (< 5 sec)
Cloud Whisper — optional Groq or OpenAI API with your own key (10-30 sec for 10 min)
MLX Whisper — local GPU transcription on Apple Silicon (30-60 sec for 10 min)
OpenAI Whisper — local CPU fallback that works everywhere (2-5 min for 10 min)

Works fully offline by default. Cloud Whisper is optional — bring your own API key to enable it.

Features

Local + cloud transcription — free local Whisper by default, optional cloud Whisper (Groq/OpenAI) for faster results with your own API key
Multi-language captions — request captions in any language YouTube supports (see Language Preference below)
Summarize with LLM — send any transcript straight to ChatGPT or Claude. ChatGPT opens with the prompt pre-filled; Claude copies it to your clipboard so you can paste (⌘V) into a new chat
Queue system — batch-process multiple videos
Search & filter your transcript library
Export as Markdown or copy to clipboard with timestamps
Duplicate detection — same video won't be saved twice
Speaker diarization — optional speaker identification with pyannote.audio
SQLite storage — all data stays on your machine
Fully offline-capable after initial setup

Full REST API docs: docs/API.md | OpenAPI spec: docs/openapi.yaml

Language Preference

By default, the app fetches English captions. You can change this per-request or globally.

Per-request — pass lang in the API body:

curl -X POST http://localhost:19720/api/transcripts \
  -H 'Content-Type: application/json' \
  -d '{"url": "https://youtube.com/watch?v=...", "lang": "es"}'

Multi-language priority — tries each language in order, falls back to first available:

-d '{"url": "...", "lang": "ja,en"}'   # Japanese preferred, English fallback

Global default — set in .env:

YTT_CAPTION_LANGS="zh-Hans,zh-Hant,en"

The MCP tools (transcribe, transcribe_and_summarize) also accept an optional lang parameter.

Manual Installation

If the automated setup doesn't work or you prefer to do it yourself:

Expand manual steps

git clone https://github.com/lifesized/youtube-transcriber.git
cd youtube-transcriber

# Install Node dependencies
npm install

# Set up Python virtual environment
python3 -m venv .venv
source .venv/bin/activate  # Windows: .venv\Scripts\activate

# Install Whisper
pip install openai-whisper

# Optional: MLX Whisper for Apple Silicon
pip install mlx-whisper

# Configure environment
cp .env.example .env
# Edit .env with your paths

Environment variables (.env):

DATABASE_URL="file:./dev.db"
WHISPER_CLI="/path/to/your/.venv/bin/whisper"
WHISPER_PYTHON_BIN="/path/to/your/.venv/bin/python3"

# Optional — local Whisper
# WHISPER_BACKEND="auto"    # auto, mlx, or openai
# WHISPER_DEVICE="auto"     # auto, cpu, mps
# WHISPER_TIMEOUT_MS="480000"

# Optional — cloud Whisper (BYOK)
# WHISPER_CLOUD_PROVIDER="groq"   # groq (default) or openai
# WHISPER_CLOUD_API_KEY="gsk_..."
# WHISPER_CLOUD_MODEL=""          # optional model override

Windows paths:

WHISPER_CLI="C:\\Users\\YourName\\project\\.venv\\Scripts\\whisper.exe"
WHISPER_PYTHON_BIN="C:\\Users\\YourName\\project\\.venv\\Scripts\\python.exe"

Troubleshooting

"spawn whisper ENOENT" error

Check that WHISPER_CLI and WHISPER_PYTHON_BIN paths in .env are correct
Use absolute paths, not relative paths
Restart the dev server after updating .env

Slow transcription

Enable cloud Whisper for the fastest option: set WHISPER_CLOUD_API_KEY in .env (Groq free tier available)
On Apple Silicon, install mlx-whisper for 3-5x local speedup
Use smaller Whisper models (tiny, base) for faster local results
Set WHISPER_BACKEND="mlx" in .env to force MLX

Rate limiting / bot detection

The app automatically tries multiple InnerTube clients
Wait a few minutes and retry if YouTube blocks requests
Disable VPN if you're getting consistent 403 errors

Contributing

Contributions welcome — feel free to submit issues or pull requests.

License

GNU Affero General Public License v3.0

Credits

Built by lifesized.

Built with: Intent by Augment, Cursor, Codex, Claude Code, and Ghostty.

Name		Name	Last commit message	Last commit date
Latest commit History 97 Commits
app		app
components		components
contrib		contrib
docs		docs
lib		lib
mcp-server		mcp-server
prisma		prisma
scripts		scripts
.env.example		.env.example
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
eslint.config.mjs		eslint.config.mjs
next-env.d.ts		next-env.d.ts
next.config.ts		next.config.ts
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
prisma.config.ts		prisma.config.ts
tsconfig.json		tsconfig.json
tsconfig.tsbuildinfo		tsconfig.tsbuildinfo

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

YouTube Transcriber

Get Running in 60 Seconds

Use as a Skill

Full Skill (all features, requires running service)

Lite Skill (zero setup, just yt-dlp)

Triggers

Use as an MCP Server

How It Works

Features

Language Preference

Manual Installation

Troubleshooting

Contributing

License

Credits

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

YouTube Transcriber

Get Running in 60 Seconds

Use as a Skill

Full Skill (all features, requires running service)

Lite Skill (zero setup, just yt-dlp)

Triggers

Use as an MCP Server

How It Works

Features

Language Preference

Manual Installation

Troubleshooting

Contributing

License

Credits

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages