LEAP Submission to CHiME-6 ASR Challenge}

Anirudh Sreeram; Anurenjan Purushothaman; Rohit Kumar; Sriram; Ganapathy

arXiv:2005.11258·eess.AS·May 25, 2020

LEAP Submission to CHiME-6 ASR Challenge}

Anirudh Sreeram, Anurenjan Purushothaman, Rohit Kumar, Sriram, Ganapathy

PDF

TL;DR

This paper presents LEAP's ASR system for the CHiME-6 challenge, utilizing data augmentation and advanced neural architectures to improve speech recognition in challenging noisy home environments.

Contribution

The paper introduces a novel combination of data augmentation and a hybrid TDNN-LSTM neural architecture for robust speech recognition in noisy conditions.

Findings

01

2% relative WER improvement over baseline

02

Effective use of data augmentation techniques

03

Hybrid TDNN-LSTM architecture enhances recognition accuracy

Abstract

This paper reports the LEAP submission to the CHiME-6 challenge. The CHiME-6 Automatic Speech Recognition (ASR) challenge Track 1 involved the recognition of speech in noisy and reverberant acoustic conditions in home environments with multiple-party interactions. For the challenge submission, the LEAP system used extensive data augmentation and a factorized time-delay neural network (TDNN) architecture. We also explored a neural architecture that interleaved the TDNN layers with LSTM layers. The submitted system improved the Kaldi recipe by 2% in terms of relative word-error-rate improvements.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.