Generalized Fast Multichannel Nonnegative Matrix Factorization Based on Gaussian Scale Mixtures for Blind Source Separation
Mathieu Fontaine (LTCI, RIKEN AIP), Kouhei Sekiguchi (RIKEN AIP),, Aditya Nugraha (RIKEN AIP), Yoshiaki Bando (AIST, RIKEN AIP), Kazuyoshi, Yoshii (RIKEN AIP)

TL;DR
This paper introduces a generalized heavy-tailed extension of FastMNMF using Gaussian scale mixtures, improving blind source separation performance in speech tasks.
Contribution
It proposes GSM-FastMNMF, a unified framework that encompasses existing heavy-tailed FastMNMF variants and introduces a new instance based on the generalized hyperbolic distribution.
Findings
Normal-inverse Gaussian FastMNMF outperforms existing methods.
The framework unifies various heavy-tailed distributions under one model.
Experiments show improved speech separation and enhancement results.
Abstract
This paper describes heavy-tailed extensions of a state-of-the-art versatile blind source separation method called fast multichannel nonnegative matrix factorization (FastMNMF) from a unified point of view. The common way of deriving such an extension is to replace the multivariate complex Gaussian distribution in the likelihood function with its heavy-tailed generalization, e.g., the multivariate complex Student's t and leptokurtic generalized Gaussian distributions, and tailor-make the corresponding parameter optimization algorithm. Using a wider class of heavy-tailed distributions called a Gaussian scale mixture (GSM), i.e., a mixture of Gaussian distributions whose variances are perturbed by positive random scalars called impulse variables, we propose GSM-FastMNMF and develop an expectationmaximization algorithm that works even when the probability density function of the impulse…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
