Nonnegative HMM for Babble Noise Derived from Speech HMM: Application to Speech Enhancement
Nasser Mohammadiha, Arne Leijon

TL;DR
This paper introduces a novel gamma nonnegative HMM for babble noise, leveraging speech models to improve noise reduction in speech enhancement, with significant performance gains demonstrated through evaluations.
Contribution
It develops a gamma nonnegative HMM for babble noise based on speech HMMs, enabling more effective noise reduction in speech processing.
Findings
Significant improvement over conventional noise reduction methods.
Effective modeling of babble noise using speech basis matrices.
Enhanced subjective and objective speech quality.
Abstract
Deriving a good model for multitalker babble noise can facilitate different speech processing algorithms, e.g. noise reduction, to reduce the so-called cocktail party difficulty. In the available systems, the fact that the babble waveform is generated as a sum of N different speech waveforms is not exploited explicitly. In this paper, first we develop a gamma hidden Markov model for power spectra of the speech signal, and then formulate it as a sparse nonnegative matrix factorization (NMF). Second, the sparse NMF is extended by relaxing the sparsity constraint, and a novel model for babble noise (gamma nonnegative HMM) is proposed in which the babble basis matrix is the same as the speech basis matrix, and only the activation factors (weights) of the basis vectors are different for the two signals over time. Finally, a noise reduction algorithm is proposed using the derived speech and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
