Speech Dereverberation Using Nonnegative Convolutive Transfer Function   and Spectro temporal Modeling

Nasser Mohammadiha; Simon Doclo

arXiv:1709.05557·cs.SD·September 19, 2017

Speech Dereverberation Using Nonnegative Convolutive Transfer Function and Spectro temporal Modeling

Nasser Mohammadiha, Simon Doclo

PDF

TL;DR

This paper introduces two novel single-channel speech dereverberation methods combining nonnegative convolutive transfer function and spectro-temporal modeling, significantly improving speech quality in reverberant environments.

Contribution

The paper proposes two new methods integrating NCTF and NMF models for speech dereverberation, including an extension exploiting temporal dependencies, with demonstrated superior performance.

Findings

01

Integrated method outperforms baseline NCTF and spectral enhancement methods.

02

Weighted method can outperform in quality measures depending on room acoustics.

03

Temporal dependency modeling benefits highly reverberant conditions.

Abstract

This paper presents two single channel speech dereverberation methods to enhance the quality of speech signals that have been recorded in an enclosed space. For both methods, the room acoustics are modeled using a nonnegative approximation of the convolutive transfer function (NCTF), and to additionally exploit the spectral properties of the speech signal, such as the low rank nature of the speech spectrogram, the speech spectrogram is modeled using nonnegative matrix factorization (NMF). Two methods are described to combine the NCTF and NMF models. In the first method, referred to as the integrated method, a cost function is constructed by directly integrating the speech NMF model into the NCTF model, while in the second method, referred to as the weighted method, the NCTF and NMF based cost functions are weighted and summed. Efficient update rules are derived to solve both…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.