Single Channel Blind Dereverberation of Speech Signals
Dhruv Nigam

TL;DR
This paper explores various non-negative matrix factor deconvolution techniques for single-channel speech dereverberation, proposing novel methods and comparing their effectiveness using objective speech quality metrics.
Contribution
It introduces a new NMFD-based approach for dereverberation and extends existing methods with temporal modeling, providing a comparative analysis of techniques on standard datasets.
Findings
The proposed NMFD-based method shows improved PESQ scores.
Temporal extensions enhance dereverberation performance.
Exact results remain inconsistent despite quantitative improvements.
Abstract
Dereverberation of recorded speech signals is one of the most pertinent problems in speech processing. In the present work, the objective is to understand and implement dereverberation techniques that aim at enhancing the magnitude spectrogram of reverberant speech signals to remove the reverberant effects introduced. An approach to estimate a clean speech spectrogram from the reverberant speech spectrogram is proposed. This is achieved through non-negative matrix factor deconvolution(NMFD). Further, this approach is extended using the NMF representation for speech magnitude spectrograms. To exploit temporal dependencies, a convolutive NMF-based representation and a frame-stacked model are incorporated into the NMFD framework for speech. A novel approach for dereverberation by applying NMFD to the activation matrix of the reverberated magnitude spectrogram is also proposed. Finally, a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Blind Source Separation Techniques · Speech Recognition and Synthesis
