Preserving the beamforming effect for spatial cue-based pseudo-binaural dereverberation of a single source
Sania Gul, Muhammad Salman Khan, Syed Waqar Shah

TL;DR
This paper introduces a novel binaural dereverberation method that preserves spatial cues using a deep learning approach, outperforming classical and existing deep models in speech clarity and reverberation reduction.
Contribution
It presents a new deep learning-based dereverberation model that maintains spatial cues and requires less training data than previous deep models.
Findings
Outperforms classical weighted prediction error model in multiple metrics.
Achieves better cepstral distance with smaller training dataset.
Maintains performance under unseen acoustic conditions.
Abstract
Reverberations are unavoidable in enclosures, resulting in reduced intelligibility for hearing impaired and non native listeners and even for the normal hearing listeners in noisy circumstances. It also degrades the performance of machine listening applications. In this paper, we propose a novel approach of binaural dereverberation of a single speech source, using the differences in the interaural cues of the direct path signal and the reverberations. Two beamformers, spaced at an interaural distance, are used to extract the reverberations from the reverberant speech. The interaural cues generated by these reverberations and those generated by the direct path signal act as a two class dataset, used for the training of U-Net (a deep convolutional neural network). After its training, the beamformers are removed and the trained U-Net along with the maximum likelihood estimation (MLE)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Underwater Acoustics Research · Indoor and Outdoor Localization Technologies
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Concatenated Skip Connection · Convolution · Max Pooling · Network On Network · U-Net
