Modified SPLICE and its Extension to Non-Stereo Data for Noise Robust Speech Recognition
D. S. Pavan Kumar, N. Vishnu Prasad, Vikas Joshi, S. Umesh

TL;DR
This paper introduces a modified SPLICE algorithm based on feature correlations for noise-robust speech recognition, extending it to non-stereo data and incorporating an efficient MLLR-based runtime adaptation, resulting in significant performance improvements.
Contribution
It proposes a novel modification to SPLICE using feature correlations, extends the method to non-stereo datasets, and introduces an efficient runtime noise adaptation technique.
Findings
8.6% absolute improvement over SPLICE on Aurora-2 Test C
10.37% improvement over baseline on Aurora-2 with non-stereo data
9.89% absolute improvement with run-time adaptation
Abstract
In this paper, a modification to the training process of the popular SPLICE algorithm has been proposed for noise robust speech recognition. The modification is based on feature correlations, and enables this stereo-based algorithm to improve the performance in all noise conditions, especially in unseen cases. Further, the modified framework is extended to work for non-stereo datasets where clean and noisy training utterances, but not stereo counterparts, are required. Finally, an MLLR-based computationally efficient run-time noise adaptation method in SPLICE framework has been proposed. The modified SPLICE shows 8.6% absolute improvement over SPLICE in Test C of Aurora-2 database, and 2.93% overall. Non-stereo method shows 10.37% and 6.93% absolute improvements over Aurora-2 and Aurora-4 baseline models respectively. Run-time adaptation shows 9.89% absolute improvement in modified…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
