SkipConvNet: Skip Convolutional Neural Network for Speech Dereverberation using Optimally Smoothed Spectral Mapping
Vinay Kothapally, Wei Xia, Shahram Ghorbani, John H.L. Hansen, Wei, Xue, Jing Huang

TL;DR
This paper introduces SkipConvNet, a novel speech dereverberation neural network that replaces skip connections with convolutional modules and uses spectral smoothing, leading to improved speech quality in reverberant environments.
Contribution
The study proposes SkipConvNet with convolutional skip modules and spectral smoothing, enhancing feature learning and dereverberation performance over existing FCN-based methods.
Findings
SkipConvNet outperforms other approaches in speech quality metrics.
Spectral smoothing improves network efficiency and dereverberation results.
The method benefits downstream speech recognition and verification tasks.
Abstract
The reliability of using fully convolutional networks (FCNs) has been successfully demonstrated by recent studies in many speech applications. One of the most popular variants of these FCNs is the `U-Net', which is an encoder-decoder network with skip connections. In this study, we propose `SkipConvNet' where we replace each skip connection with multiple convolutional modules to provide decoder with intuitive feature maps rather than encoder's output to improve the learning capacity of the network. We also propose the use of optimal smoothing of power spectral density (PSD) as a pre-processing step, which helps to further enhance the efficiency of the network. To evaluate our proposed system, we use the REVERB challenge corpus to assess the performance of various enhancement approaches under the same conditions. We focus solely on monitoring improvements in speech quality and their…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
