Skip to content

oysterlanguage/clearcut

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

12 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

clearcut: Ultra-Precise Audio Segmentation

clearcut is an advanced audio processing tool designed for ultra-clean audio segmentation. Whether you're working on speech recognition, transcription alignment, or other audio-based applications, ClearCut ensures your segments are precise, clean, and never clip the first or last word of a segment.

Orchestrates the entire pipeline:

  1. Standardizes audio (resampling, volume normalization)
  2. Optionally separates vocals from instrumentals
  3. Performs ASR (using WhisperX) or loads user transcripts if provided (.txt files)
  4. Detects breath intervals using respiro.py
  5. Aligns transcripts and breath intervals
  6. Segments audio based on threshold crossing & minima detection
  7. Writes out training text files and optional TextGrid annotations

Installation

Clone this repository:

git clone https://github.com/your-username/audio-segmentation-alignment.git
cd audio-segmentation-alignment

download models:

https://github.com/ydqmkkx/Respiro-en/blob/main/respiro-en.pt to your models directory https://github.com/TRvlvr/model_repo/releases/download/all_public_uvr_models/UVR-MDX-NET-Inst_HQ_3.onnx to your models directory

Create and activate a Python virtual environment (recommended):

python3 -m venv venv
source venv/bin/activate
# or on Windows: venv\Scripts\activate

Install dependencies:

pip install -r requirements.txt

Some of these libraries (e.g., onnxruntime-gpu, torch) might require specialized wheels if you plan to run on GPU. Please consult the official PyTorch and onnxruntime documentation for platform-specific instructions.

After installing, you can run the entire pipeline via:

python main.py --config config/config.yaml

Acknowledgement ๐Ÿ””

This project would no be possible without the work by these excellent developers!

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages