DPDFNet: Boosting DeepFilterNet2 via Dual-Path RNN

_{--- Official project for the DPDFNet paper ---}

Install the PyPI Package

For CPU-only ONNX inference using the packaged CLI and Python API:

pip install dpdfnet

CLI Example

# Enhance one file
dpdfnet enhance noisy.wav enhanced.wav --model dpdfnet4

# Enhance a directory
dpdfnet enhance-dir ./noisy_wavs ./enhanced_wavs --model dpdfnet2

# Download models
dpdfnet download
dpdfnet download dpdfnet8
dpdfnet download dpdfnet4 --force

Python API Example

import soundfile as sf
import dpdfnet

# In-memory enhancement:
audio, sr = sf.read("noisy.wav")
enhanced = dpdfnet.enhance(audio, sample_rate=sr, model="dpdfnet4")
sf.write("enhanced.wav", enhanced, sr)

# Enhance one file:
out_path = dpdfnet.enhance_file("noisy.wav", model="dpdfnet2")
print(out_path)

# Model listing:
for row in dpdfnet.available_models():
    print(row["name"], row["ready"], row["cached"])

# Download models:
dpdfnet.download()				# All models
dpdfnet.download("dpdfnet4")	# Specific model

Run From Source

1) Install dependencies

python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

2) Download models

Model files are not bundled in this repository.
Download PyTorch checkpoints, TFLite, and ONNX models from Hugging Face:

pip install -U "huggingface_hub[cli]"

# create target dirs
mkdir -p model_zoo/{checkpoints,onnx,tflite}

# PyTorch checkpoints (HF path: checkpoints/* -> local: model_zoo/checkpoints/*)
hf download Ceva-IP/DPDFNet \
  --include "checkpoints/*.pth" \
  --local-dir model_zoo \

# ONNX models (&states) (HF path: onnx/* -> local: model_zoo/onnx/*)
hf download Ceva-IP/DPDFNet \
  --include "onnx/*.onnx" \
  --local-dir model_zoo \

hf download Ceva-IP/DPDFNet \
  --include "onnx/*.npz" \
  --local-dir model_zoo \

# TFLite models (HF path: *.tflite at repo root -> local: model_zoo/tflite/*)
hf download Ceva-IP/DPDFNet \
  --include "*.tflite" \
  --local-dir model_zoo/tflite \

3) Run offline enhancement

Put one or more *.wav files in ./noisy_wavs, then choose one:

Option A: `TFLite`

python -m tflite_model.infer_dpdfnet_tflite \
	--noisy_dir ./noisy_wavs \
	--enhanced_dir ./enhanced_wavs \
	--model_name dpdfnet4

Option B: `ONNX`

python -m onnx_model.infer_dpdfnet_onnx \
  	--noisy_dir ./noisy_wavs \
	--enhanced_dir ./enhanced_wavs \
	--model_name dpdfnet4

Enhanced files are written as:

<original_stem>_<model_name>.wav

Audio Samples & Demo

Project page with examples: https://ceva-ip.github.io/DPDFNet/
Gradio application: https://huggingface.co/spaces/Ceva-IP/DPDFNetDemo
Hugging Face model hub: https://huggingface.co/Ceva-IP/DPDFNet
Evaluation dataset used in the paper: https://huggingface.co/datasets/Ceva-IP/DPDFNet_EvalSet

Real-Time Demo

Run:

python -m real_time_demo

How it works:

Captures microphone audio in streaming hops.
Enhances each hop frame-by-frame with ONNX.
Displays live noisy vs enhanced spectrograms.
Allows you to control the noise‑reduction level during playback: 0 for the raw stream and 1 for the fully enhanced stream.
Enables the use of AGC during playback.

To change model, edit MODEL_NAME near the top of real_time_demo.py.

Model Profile

16 kHz models

Model	Params [M]	MACs [G]	TFLite Size [MB]	ONNX Size [MB]	Intended Use
baseline	2.31	0.36	8.5	8.5	Fastest / lowest resource usage
dpdfnet2	2.49	1.35	10.7	9.9	Real-time / embedded devices
dpdfnet4	2.84	2.36	12.9	11.2	Balanced performance
dpdfnet8	3.54	4.37	17.2	14.1	Best enhancement quality

48 kHz model

Model	Params [M]	MACs [G]	TFLite Size [MB]	ONNX Size [MB]	Intended Use
dpdfnet2_48khz_hr	2.58	2.42	11.6	10.3	High-resolution 48 kHz audio

Troubleshooting / FAQ

`Q: Model files are missing (TFLite / ONNX / checkpoints)`

Run the Hugging Face download commands from the Run From Source section.
Confirm files are in:
- model_zoo/tflite/
- model_zoo/onnx/
- model_zoo/checkpoints/

`Q: No .wav files found`

Both offline scripts scan only the exact folder given by --noisy_dir (non-recursive).
Ensure input files use .wav extension.

`Q: Real-time demo has audio device errors`

Check microphone permissions and default input/output device settings.
Install host audio dependencies for sounddevice (PortAudio packages on your OS).

`Q: Real-time GUI does not open`

Ensure Qt dependencies from requirements.txt installed successfully.
On headless servers, run offline enhancement instead.

`Q: I get import/module errors when running commands`

Run from repo root and use module form exactly as documented (python -m ...).
Activate your virtual environment before running commands.

`Q: CPU is too slow for my target`

Try smaller models (baseline, dpdfnet2).
Benchmark ONNX runtime using python -m onnx_model.infer_dpdfnet_onnx ... and compare RTF.

Evaluation Metrics

To compute intrusive and non-intrusive metrics on our DPDFNet EvalSet, we use the tools listed below. For aggregate quality reporting, we rely on PRISM, the scale‑normalized composite metric introduced in the DPDFNet paper.

Intrusive metrics: PESQ, STOI, SI-SNR

We provide a dedicated script, pesq_stoi_sisnr_calc.py, which computes PESQ, STOI, and SI-SNR for paired reference and enhanced audio. The script includes a built-in auto-alignment step that corrects small start-time offsets and drift between the reference and the enhanced signals before scoring, to ensure fair comparisons.

Non-intrusive metrics

DNSMOS (P.835 & P.808) - We use the official DNSMOS local inference script from the DNS Challenge repository: dnsmos_local.py. Please follow their installation and model download instructions in that project before running.
NISQA v2 - We use the official NISQA project: https://github.com/gabrielmittag/NISQA. Refer to their README for environment setup, pretrained model weights, and inference commands (e.g., running nisqa_predict.py on a folder of WAVs).

Citation

@article{rika2025dpdfnet,
 title = {DPDFNet: Boosting DeepFilterNet2 via Dual-Path RNN},
 author = {Rika, Daniel and Sapir, Nino and Gus, Ido},
 year = {2025},
}

License

Apache License 2.0. See LICENSE.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DPDFNet: Boosting DeepFilterNet2 via Dual-Path RNN

Install the PyPI Package

CLI Example

Python API Example

Run From Source

1) Install dependencies

2) Download models

3) Run offline enhancement

Option A: `TFLite`

Option B: `ONNX`

Audio Samples & Demo

Real-Time Demo

Model Profile

16 kHz models

48 kHz model

Troubleshooting / FAQ

`Q: Model files are missing (TFLite / ONNX / checkpoints)`

`Q: No .wav files found`

`Q: Real-time demo has audio device errors`

`Q: Real-time GUI does not open`

`Q: I get import/module errors when running commands`

`Q: CPU is too slow for my target`

Evaluation Metrics

Intrusive metrics: PESQ, STOI, SI-SNR

Non-intrusive metrics

Citation

License

About

Uh oh!

Releases 2

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 66 Commits
figures		figures
model		model
onnx_model		onnx_model
package		package
tflite_model		tflite_model
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
banner.py		banner.py
pesq_stoi_sisnr_calc.py		pesq_stoi_sisnr_calc.py
real_time_demo.py		real_time_demo.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

DPDFNet: Boosting DeepFilterNet2 via Dual-Path RNN

Install the PyPI Package

CLI Example

Python API Example

Run From Source

1) Install dependencies

2) Download models

3) Run offline enhancement

Option A: TFLite

Option B: ONNX

Audio Samples & Demo

Real-Time Demo

Model Profile

16 kHz models

48 kHz model

Troubleshooting / FAQ

Q: Model files are missing (TFLite / ONNX / checkpoints)

Q: No .wav files found

Q: Real-time demo has audio device errors

Q: Real-time GUI does not open

Q: I get import/module errors when running commands

Q: CPU is too slow for my target

Evaluation Metrics

Intrusive metrics: PESQ, STOI, SI-SNR

Non-intrusive metrics

Citation

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Contributors

Uh oh!

Languages

Option A: `TFLite`

Option B: `ONNX`

`Q: Model files are missing (TFLite / ONNX / checkpoints)`

`Q: No .wav files found`

`Q: Real-time demo has audio device errors`

`Q: Real-time GUI does not open`

`Q: I get import/module errors when running commands`

`Q: CPU is too slow for my target`