SkipConvGAN: Monaural Speech Dereverberation using Generative   Adversarial Networks via Complex Time-Frequency Masking

Vinay Kothapally; J. H. L. Hansen

arXiv:2211.12623·eess.AS·November 24, 2022

SkipConvGAN: Monaural Speech Dereverberation using Generative Adversarial Networks via Complex Time-Frequency Masking

Vinay Kothapally, J. H. L. Hansen

PDF

TL;DR

This paper introduces SkipConvGAN, a novel generative adversarial network designed for monaural speech dereverberation, which effectively restores formant structures in reverberant speech using complex time-frequency masking.

Contribution

It presents a new GAN architecture that estimates complex masks to recover lost formant structures, improving dereverberation performance over existing methods.

Findings

01

Consistent improvement on simulated reverberant speech

02

Outperforms other deep learning-based GAN frameworks

03

Effective in multiple room configurations

Abstract

With the advancements in deep learning approaches, the performance of speech enhancing systems in the presence of background noise have shown significant improvements. However, improving the system's robustness against reverberation is still a work in progress, as reverberation tends to cause loss of formant structure due to smearing effects in time and frequency. A wide range of deep learning-based systems either enhance the magnitude response and reuse the distorted phase or enhance complex spectrogram using a complex time-frequency mask. Though these approaches have demonstrated satisfactory performance, they do not directly address the lost formant structure caused by reverberation. We believe that retrieving the formant structure can help improve the efficiency of existing systems. In this study, we propose SkipConvGAN - an extension of our prior work SkipConvNet. The proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.