Nonnegative HMM for Babble Noise Derived from Speech HMM: Application to   Speech Enhancement

Nasser Mohammadiha; Arne Leijon

arXiv:1709.05559·cs.SD·September 19, 2017

Nonnegative HMM for Babble Noise Derived from Speech HMM: Application to Speech Enhancement

Nasser Mohammadiha, Arne Leijon

PDF

TL;DR

This paper introduces a novel gamma nonnegative HMM for babble noise, leveraging speech models to improve noise reduction in speech enhancement, with significant performance gains demonstrated through evaluations.

Contribution

It develops a gamma nonnegative HMM for babble noise based on speech HMMs, enabling more effective noise reduction in speech processing.

Findings

01

Significant improvement over conventional noise reduction methods.

02

Effective modeling of babble noise using speech basis matrices.

03

Enhanced subjective and objective speech quality.

Abstract

Deriving a good model for multitalker babble noise can facilitate different speech processing algorithms, e.g. noise reduction, to reduce the so-called cocktail party difficulty. In the available systems, the fact that the babble waveform is generated as a sum of N different speech waveforms is not exploited explicitly. In this paper, first we develop a gamma hidden Markov model for power spectra of the speech signal, and then formulate it as a sparse nonnegative matrix factorization (NMF). Second, the sparse NMF is extended by relaxing the sparsity constraint, and a novel model for babble noise (gamma nonnegative HMM) is proposed in which the babble basis matrix is the same as the speech basis matrix, and only the activation factors (weights) of the basis vectors are different for the two signals over time. Finally, a noise reduction algorithm is proposed using the derived speech and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.