Podlog

Self-hosted podcast transcription and search

Add RSS feeds, transcribe episodes with Whisper, label speakers with pyannote, and search across all your transcripts — everything runs locally in Docker.

Features

Hybrid search — full-text keyword search with phrase matching ("exact quotes", OR, -exclude) plus semantic vector search powered by pgvector
Speaker diarization — automatic speaker labeling with per-episode renaming and AI-inferred speaker names
Granular timestamps — sentence-level timestamps within speaker sections, clickable to play audio from any point
Persistent audio player — click any timestamp to play; player continues across page navigation
Search export — download search reports as Markdown, plain text, or print-friendly PDF
Queue dashboard — live processing status, filter by stage, error classification with auto-retry
Episode reprocessing — re-queue any episode from its page after model upgrades or config changes
Dark mode — toggleable, remembers your preference
No cloud dependencies — all data stays on your machine, no external API calls

Quick Start

# 1. Clone
git clone https://github.com/brlauuu/podlog.git
cd podlog

# 2. Configure (set POSTGRES_PASSWORD and HF_TOKEN)
cp .env.example .env
nano .env

# 3. Build and start
make build
make up

Open http://localhost:3000 — that's it.

First run: The worker downloads Whisper and pyannote model weights (~3 GB). Jobs are queued during this phase and start processing once models are cached.

Prerequisites

Docker with Compose V2
A free HuggingFace account with an access token
You must accept the pyannote license at pyannote/speaker-diarization-3.1 before diarization will work

Architecture

                        ┌──────────────────────────────────────────────┐
  Browser :3000  ──────>│  web (Next.js 14)                            │
                        │    Search, episodes, queue, audio player     │
                        │    Reads PostgreSQL directly for FTS/vector  │
                        │    Proxies to pipeline API for management    │
                        └──────────────┬───────────────────────────────┘
                                       │
                        ┌──────────────▼───────────────────────────────┐
  Pipeline API :8000 ──>│  pipeline (FastAPI)                          │
                        │    Feed management, queue control, health    │
                        │    Embed API (MiniLM query embedding)        │
                        └──────────────────────────────────────────────┘
                                       │
                        ┌──────────────▼───────────────────────────────┐
                        │  worker (Python)                             │
                        │    download → transcribe → diarize → embed  │
                        │    → infer speakers → archive                │
                        │    Sequential processing (concurrency=1)     │
                        │    Whisper + pyannote never in memory at once│
                        └──────────────┬───────────────────────────────┘
                                       │
                        ┌──────────────▼───────────────────────────────┐
                        │  db (PostgreSQL 15 + pgvector)               │
                        │    Episodes, segments, speaker names         │
                        │    FTS via GIN index + vector HNSW index     │
                        │    Job queue with FOR UPDATE SKIP LOCKED     │
                        └──────────────────────────────────────────────┘

5 containers. No Redis, no Celery — the job queue is PostgreSQL-backed.

Configuration

Only two variables are required. Everything else has sensible defaults.

Variable	Default	Description
`POSTGRES_PASSWORD`	(required)	PostgreSQL password
`HF_TOKEN`	(required)	HuggingFace access token for pyannote
`WHISPER_MODEL`	`large-v3-turbo`	Model size: `tiny`, `base`, `small`, `medium`, `large-v3`, `large-v3-turbo`
`WHISPER_COMPUTE_TYPE`	`int8`	`int8` (fast, recommended for CPU) or `float32`
`ARCHIVE_AUDIO`	`true`	Archive audio as compressed MP3 after transcription
`FEED_POLL_INTERVAL_HOURS`	`24`	How often to check feeds for new episodes

See docs/configuration.md for the full list of all environment variables.

Documentation

Document	Description
Configuration	All environment variables with defaults and explanations
Hardware Guide	System requirements, processing benchmarks, tested machine specs
Development	Local development setup, running tests, project structure

Common Commands

make up              # Start all services
make down            # Stop all services
make build           # Rebuild Docker images
make logs            # Follow logs for all services
make test-unit       # Run unit tests (91 tests, ~1 second)
make shell-db        # Open psql shell
make help            # List all available commands

Tech Stack

Layer	Technology	Role
WhisperX	Whisper large-v3-turbo + CTranslate2	Speech-to-text transcription
faster-whisper	CTranslate2 backend	Fast CPU inference for Whisper
pyannote	Speaker diarization 3.1	Speaker labeling and separation
sentence-transformers	all-MiniLM-L6-v2	Semantic search embeddings (384-dim)
pgvector	PostgreSQL vector extension	Approximate nearest neighbor search
Next.js 14	App Router, React Server Components	Web UI
Tailwind CSS + shadcn/ui	Utility-first CSS + components	Styling
FastAPI	Python async web framework	Pipeline API
PostgreSQL 15	Relational database	Storage, FTS, job queue, vector search
Docker Compose	Container orchestration	Deployment

Credits

Built by @brlauuu and Claude (Anthropic).

License

O'Saasy License. See LICENSE.

pyannote models are subject to their own license — you must accept this independently at huggingface.co/pyannote/speaker-diarization-3.1. Users are responsible for copyright compliance with podcast audio content.

Name		Name	Last commit message	Last commit date
Latest commit History 121 Commits
apps		apps
docs		docs
prds		prds
.env.example		.env.example
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
docker-compose.test.yml		docker-compose.test.yml
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Podlog

Features

Quick Start

Prerequisites

Architecture

Configuration

Documentation

Common Commands

Tech Stack

Credits

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Podlog

Features

Quick Start

Prerequisites

Architecture

Configuration

Documentation

Common Commands

Tech Stack

Credits

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages