TL;DR
This paper introduces novel supervised and unsupervised speech enhancement algorithms based on Bayesian nonnegative matrix factorization, which outperform existing methods and do not require prior noise models.
Contribution
It proposes a Bayesian NMF framework for speech enhancement, including a noise model learning scheme and a hybrid HMM approach to address noise mismatch issues.
Findings
BNMF-based methods outperform state-of-the-art algorithms
The hybrid BNMF-HMM approach effectively handles unknown noise types
Online noise model learning enables unsupervised speech enhancement
Abstract
Reducing the interference noise in a monaural noisy speech signal has been a challenging task for many years. Compared to traditional unsupervised speech enhancement methods, e.g., Wiener filtering, supervised approaches, such as algorithms based on hidden Markov models (HMM), lead to higher-quality enhanced speech signals. However, the main practical difficulty of these approaches is that for each noise type a model is required to be trained a priori. In this paper, we investigate a new class of supervised speech denoising algorithms using nonnegative matrix factorization (NMF). We propose a novel speech enhancement method that is based on a Bayesian formulation of NMF (BNMF). To circumvent the mismatch problem between the training and testing stages, we propose two solutions. First, we use an HMM in combination with BNMF (BNMF-HMM) to derive a minimum mean square error (MMSE)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
