Magnitude or Phase? A Two Stage Algorithm for Dereverberation
Ayal Schwartz, Sharon Gannot, Shlomo E. Chazan

TL;DR
This paper introduces a two-stage single-microphone speech dereverberation algorithm that separately estimates magnitude and phase, leading to improved speech quality by leveraging their distinct contributions.
Contribution
The paper proposes a novel two-model architecture that estimates clean magnitude and phase separately, enhancing dereverberation performance over existing methods.
Findings
Consistent improvements in dereverberation metrics
Effective separation of magnitude and phase estimation
Superior performance on REVERB challenge data
Abstract
In this work we present a new single-microphone speech dereverberation algorithm. First, a performance analysis is presented to interpret that algorithms focused on improving solely magnitude or phase are not good enough. Furthermore, we demonstrate that few objective measurements have high correlation with the clean magnitude while others with the clean phase. Consequently ,we propose a new architecture which consists of two sub-models, each of which is responsible for a different task. The first model estimates the clean magnitude given the noisy input. The enhanced magnitude together with the noisy-input phase are then used as inputs to the second model to estimate the real and imaginary portions of the dereverberated signal. A training scheme including pre-training and fine-tuning is presented in the paper. We evaluate our proposed approach using data from the REVERB challenge and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Advanced Adaptive Filtering Techniques · Speech Recognition and Synthesis
