Developer Diary Audio Recorder

A cross-platform Electron desktop application for continuous, multi-hour audio recording sessions with AI-powered transcription and developer diary generation.

🎯 What is this?

This app helps developers document their workflow by:

Recording long audio sessions (hours if needed) of their work commentary
Transcribing the audio locally using Whisper AI (automatic language detection)
Generating structured developer diary reports summarizing daily work

Privacy-first, offline-capable, and optimized for daily developer use.

✨ Features

✅ Long-duration audio recording - Record for hours without memory issues
✅ Chunk-based persistence - Audio saved directly to disk in 60-second chunks
✅ Local transcription - Powered by Whisper CLI with automatic language detection
✅ Mixed-language support - Handles Portuguese, English, and other languages seamlessly
✅ Developer diary generation - AI-powered daily summaries via OpenAI API
✅ Offline-first - Recording and transcription work without internet
✅ Cross-platform - Linux, macOS, Windows support via Electron

📋 Requirements

Core Dependencies

Node.js 16+ and npm
Electron (installed via npm)
ffmpeg (optional but recommended for audio post-processing)

For Transcription

Whisper CLI - Required for local transcription feature

For Diary Generation

OpenAI API key - Required for developer diary generation

🚀 Installation

1. Clone and Install

git clone <repository-url>
cd dev-diary-audio
npm install

2. Install ffmpeg

ffmpeg fixes WebM metadata to enable seeking in media players.

Ubuntu/Debian:

sudo apt install ffmpeg

macOS:

brew install ffmpeg

Windows: Download from ffmpeg.org and add to PATH.

3. Install Whisper CLI (Required for Transcription)

Whisper requires Python 3.8+.

Install Python

sudo apt install python3 python3-venv python3-pip

Create a venv

mkdir whisper-local
cd whisper-local
python3 -m venv .venv
source .venv/bin/activate

Install via pip:

pip install --upgrade pip
pip install openai-whisper

Verify installation:

whisper --help

Note: Whisper also requires ffmpeg (see step 2 above).

For more details, see the official Whisper repository.

4. Configure OpenAI API (Required for Diary Generation)

Create a .env file in the project root:

cp .env.example .env

Edit .env and add your OpenAI API key:

OPENAI_API_KEY=your-api-key-here
OPENAI_MODEL=gpt-4o-mini
OPENAI_MAX_TOKENS=4000
OPENAI_TEMPERATURE=0.7

Get an API key: platform.openai.com/api-keys

🎮 Usage

Running the App

npm run start

Recording Audio

Click "Start Recording" button
Grant microphone permissions when prompted
Speak naturally - the app handles long sessions automatically
Click "Stop Recording" when done

Audio is saved to: ~/.config/dev-diary-audio/recordings/YYYY-MM-DD/session-HH-MM-SS/

Transcribing Recordings

Browse to a date with recordings
Click "Generate Transcription for [date]"
Wait for Whisper to process all audio files (this may take time for long recordings)
Transcript appears automatically when complete

Transcripts are saved as: recordings/YYYY-MM-DD/transcript-YYYY-MM-DD.txt

Generating Developer Diary

Ensure a transcript exists for the day
Click "Generate Developer Diary"
Wait for OpenAI API to generate the diary
Diary appears automatically when complete

Diaries are saved as: recordings/YYYY-MM-DD/diary.txt

📦 Building Distributables

Package without installer

npm run package

Create platform-specific installers

npm run make

This creates .deb, .rpm, .zip, or Windows installers depending on your platform.

📁 File Locations

Development

Recordings: ~/.config/dev-diary-audio/recordings/ (Linux)
Recordings: ~/Library/Application Support/dev-diary-audio/recordings/ (macOS)
Recordings: %APPDATA%/dev-diary-audio/recordings/ (Windows)

Directory Structure

recordings/
  └── 2026-01-24/
      ├── session-09-30-15/
      │   ├── recording.webm
      │   └── recording.json (Whisper output)
      ├── session-14-20-30/
      │   ├── recording.webm
      │   └── recording.json
      ├── transcript-2026-01-24.txt
      └── diary.txt

🏗️ Architecture

Audio Recording Pipeline

Renderer process: MediaRecorder API captures from microphone
60-second chunks: Automatic flushing prevents memory buildup
IPC to main: Chunks sent via append-audio-chunk handler
Main process: Immediately appends to disk using fs.appendFileSync()
Post-processing: ffmpeg remuxes WebM for better compatibility

Audio Settings

Format: WebM (Opus codec)
Channels: Mono (sufficient for speech)
Sample Rate: 44.1 kHz
Enhancements: Echo cancellation, noise suppression enabled

Transcription Flow

Whisper processes each recording independently
Outputs JSON with timestamped segments
App merges all segments chronologically
Single daily transcript file created

Diary Generation Flow

Reads daily transcript
Sends to OpenAI API with specialized prompt
Returns structured summary of developer's day
Saves locally as plain text

Key Files

main.js - IPC handlers, file operations, external process execution
renderer.js - MediaRecorder logic, UI state management
preload.js - Secure communication bridge between main and renderer

📖 Documentation

See the docs/ADR/ directory for detailed architectural decision records

⚠️ Known Limitations

Transcription cannot be cancelled once started
No real-time transcription progress updates
Whisper model hardcoded to 'small' (balance of speed/accuracy)
Diary generation requires internet connection
No versioning or history of diary entries

🤝 Contributing

This is currently a personal developer tool. Future contributions may be welcome as the project evolves.

📄 License

MIT

🙏 Acknowledgments

Whisper by OpenAI for excellent speech-to-text
Electron for cross-platform desktop app framework
ffmpeg for audio processing utilities

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
docs		docs
src		src
.env.example		.env.example
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
README.md		README.md
electron.vite.config.js		electron.vite.config.js
forge.config.js		forge.config.js
package-lock.json		package-lock.json
package.json		package.json
renderer.js		renderer.js

Folders and files

Latest commit

History

Repository files navigation

Developer Diary Audio Recorder

🎯 What is this?

✨ Features

📋 Requirements

Core Dependencies

For Transcription

For Diary Generation

🚀 Installation

1. Clone and Install

2. Install ffmpeg

3. Install Whisper CLI (Required for Transcription)

4. Configure OpenAI API (Required for Diary Generation)

🎮 Usage

Running the App

Recording Audio

Transcribing Recordings

Generating Developer Diary

📦 Building Distributables

Package without installer

Create platform-specific installers

📁 File Locations

Development

Directory Structure

🏗️ Architecture

Audio Recording Pipeline

Audio Settings

Transcription Flow

Diary Generation Flow

Key Files

📖 Documentation

⚠️ Known Limitations

🤝 Contributing

📄 License

🙏 Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages