clearcut: Ultra-Precise Audio Segmentation

clearcut is an advanced audio processing tool designed for ultra-clean audio segmentation. Whether you're working on speech recognition, transcription alignment, or other audio-based applications, ClearCut ensures your segments are precise, clean, and never clip the first or last word of a segment.

Orchestrates the entire pipeline:

Standardizes audio (resampling, volume normalization)
Optionally separates vocals from instrumentals
Performs ASR (using WhisperX) or loads user transcripts if provided (.txt files)
Detects breath intervals using respiro.py
Aligns transcripts and breath intervals
Segments audio based on threshold crossing & minima detection
Writes out training text files and optional TextGrid annotations

Installation

Clone this repository:

git clone https://github.com/your-username/audio-segmentation-alignment.git
cd audio-segmentation-alignment

download models:

https://github.com/ydqmkkx/Respiro-en/blob/main/respiro-en.pt to your models directory https://github.com/TRvlvr/model_repo/releases/download/all_public_uvr_models/UVR-MDX-NET-Inst_HQ_3.onnx to your models directory

Create and activate a Python virtual environment (recommended):

python3 -m venv venv
source venv/bin/activate
# or on Windows: venv\Scripts\activate

Install dependencies:

pip install -r requirements.txt

Some of these libraries (e.g., onnxruntime-gpu, torch) might require specialized wheels if you plan to run on GPU. Please consult the official PyTorch and onnxruntime documentation for platform-specific instructions.

After installing, you can run the entire pipeline via:

python main.py --config config/config.yaml

Acknowledgement 🔔

This project would no be possible without the work by these excellent developers!

ASR (WhisperX): WhisperX
Source Separation: UVR-MDX-NET-Inst_HQ_3
Breath Detection: respiro.py
Emelia: https://github.com/open-mmlab/Amphion/blob/main/preprocessors/Emilia/README.md
fairseq: https://github.com/facebookresearch/fairseq/blob/main/examples/mms/data_prep/README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

clearcut: Ultra-Precise Audio Segmentation

Installation

Acknowledgement 🔔

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
config		config
models		models
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

clearcut: Ultra-Precise Audio Segmentation

Installation

Acknowledgement 🔔

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages