Magnitude or Phase? A Two Stage Algorithm for Dereverberation

Ayal Schwartz; Sharon Gannot; Shlomo E. Chazan

arXiv:2211.00607·cs.SD·November 2, 2022

Magnitude or Phase? A Two Stage Algorithm for Dereverberation

Ayal Schwartz, Sharon Gannot, Shlomo E. Chazan

PDF

Open Access

TL;DR

This paper introduces a two-stage single-microphone speech dereverberation algorithm that separately estimates magnitude and phase, leading to improved speech quality by leveraging their distinct contributions.

Contribution

The paper proposes a novel two-model architecture that estimates clean magnitude and phase separately, enhancing dereverberation performance over existing methods.

Findings

01

Consistent improvements in dereverberation metrics

02

Effective separation of magnitude and phase estimation

03

Superior performance on REVERB challenge data

Abstract

In this work we present a new single-microphone speech dereverberation algorithm. First, a performance analysis is presented to interpret that algorithms focused on improving solely magnitude or phase are not good enough. Furthermore, we demonstrate that few objective measurements have high correlation with the clean magnitude while others with the clean phase. Consequently ,we propose a new architecture which consists of two sub-models, each of which is responsible for a different task. The first model estimates the clean magnitude given the noisy input. The enhanced magnitude together with the noisy-input phase are then used as inputs to the second model to estimate the real and imaginary portions of the dereverberated signal. A training scheme including pre-training and fine-tuning is presented in the paper. We evaluate our proposed approach using data from the REVERB challenge and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Advanced Adaptive Filtering Techniques · Speech Recognition and Synthesis