speech2text

Desktop speech-to-text powered by Groq's Whisper v3 API.
Record your voice, get the transcription pasted directly into whatever you're typing.

Install

winget install ozas.speech2text

Or download the latest installer from the releases page.

How it works

Set your Groq API key in settings
Hold your keybind (default Ctrl+Shift) or click the mic button
Speak
Release the keys (or click again) — transcription gets pasted into the active text field

A Dynamic Island-style overlay appears at the top of your screen showing recording/transcribing/done status.

Features

Push-to-talk with configurable keybind (supports any key combo including modifier-only)
Real-time audio visualizer
Transcript history
Language selection (24 languages or auto-detect)
System tray with minimize-to-tray
Lightweight native app (~5MB)

Stack

Backend: Rust via Tauri v2 — Groq API, clipboard, raw Win32 keyboard hook, keystroke simulation
Frontend: React + Vite with Web Audio API visualizer
API: Groq Whisper Large v3

Building

Requires Rust and Node.js.

npm install
npm run tauri dev

Release build:

npx tauri build

Produces a standalone NSIS installer in src-tauri/target/release/bundle/nsis/.

Getting a Groq API key

Sign up at console.groq.com, create an API key, and paste it into the app's settings panel.

Author

Built by ozas.

License

AGPL-3.0 — see LICENSE

Name		Name	Last commit message	Last commit date
Latest commit History 102 Commits
.claude		.claude
manifests/o/ozas/speech2text		manifests/o/ozas/speech2text
src-tauri		src-tauri
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
index.html		index.html
latest.json		latest.json
logo.svg		logo.svg
overlay.html		overlay.html
package-lock.json		package-lock.json
package.json		package.json
vite.config.js		vite.config.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

speech2text

Install

How it works

Features

Stack

Building

Getting a Groq API key

Author

License

About

Uh oh!

Releases 5

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

speech2text

Install

How it works

Features

Stack

Building

Getting a Groq API key

Author

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 5

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages