VACE-WPE: Virtual Acoustic Channel Expansion Based On Neural Networks   for Weighted Prediction Error-Based Speech Dereverberation

Joon-Young Yang; Joon-Hyuk Chang

arXiv:2102.05259·eess.AS·February 11, 2021·IEEE ACM Trans. Audio Speech Lang. Process.

VACE-WPE: Virtual Acoustic Channel Expansion Based On Neural Networks for Weighted Prediction Error-Based Speech Dereverberation

Joon-Young Yang, Joon-Hyuk Chang

PDF

Open Access

TL;DR

This paper introduces VACE-WPE, a neural network-based framework that enhances single-microphone speech dereverberation by virtually expanding acoustic channels, significantly improving speech quality in noisy reverberant environments.

Contribution

The study extends the WPE algorithm with a neural network-based VACE framework, enabling effective dereverberation in single-microphone scenarios and simplifying the neural network training process.

Findings

01

VACE-WPE outperforms single-channel WPE in speech quality.

02

VACE-WPE is effective in noisy reverberant environments.

03

VACE-WPE enhances automatic speech recognition performance.

Abstract

Speech dereverberation is an important issue for many real-world speech processing applications. Among the techniques developed, the weighted prediction error (WPE) algorithm has been widely adopted and advanced over the last decade, which blindly cancels out the late reverberation component from the reverberant mixture of microphone signals. In this study, we extend the neural-network-based virtual acoustic channel expansion (VACE) framework for the WPE-based speech dereverberation, a variant of the WPE that we recently proposed to enable the use of dual-channel WPE algorithm in a single-microphone speech dereverberation scenario. Based on the previous study, some ablation studies are conducted regarding the constituents of the VACE-WPE in an offline processing scenario. These studies help understand the dynamics of the system, thereby simplifying the architecture and leading to the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Acoustic Wave Phenomena Research