QuartzNet ASR

Description

Implement Nvidia's QuartzNet neural net for the task of Automatic Speech Recognition (ASR) in Tensorflow 2. QuartzNet comes from the Deep Automatic Speech Recognition with 1D Time-Channel Separable Convolutions paper and was trained with CTC loss on the LibriSpeech dataset to achieve state-of-the-art (SOTA) accuracy with a Word Error Rate (WER) in the range of 4.19 to 10.98%. QuartzNet can be found as part of Nvidia's NeMo repository on Github however, this implementation was based off of Jaco-Assistant GitLab repository and is trained on the LJSpeech dataset from Keith Ito. Training setup is taken from the Automatic Speech Recognition using CTC example from the Keras examples page.

Scripts

load_model.py

Initializes 3 QuartzNet models (5x5, 10x5, 15x5) as well as testing the custom StringMap class in quartznet.py used as a functional replacement for tf.keras.layers.StringLookup for those running versions Tensorflow below 2.6.0.

asr_ctc_quartznet.py

A spinoff of the ASR using CTC Keras example that replaces the DeepSpeech2 model with QuartzNet 15x5 and trains on the LJSpeech dataset.

Dockerfile

Dockerfile for running asr_ctc_quartznet.py in a docker container.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
__pycache__		__pycache__
Dockerfile		Dockerfile
README.md		README.md
asr_ctc_quartznet.py		asr_ctc_quartznet.py
asr_ctc_quartznet_loss_test.py		asr_ctc_quartznet_loss_test.py
config.py		config.py
load_model.py		load_model.py
quartznet.py		quartznet.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

QuartzNet ASR

Description

Scripts

Sources

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

QuartzNet ASR

Description

Scripts

Sources

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages