HeMPPCAT: Mixtures of Probabilistic Principal Component Analysers for Data with Heteroscedastic Noise
Alec S. Xu, Laura Balzano, Jeffrey A. Fessler

TL;DR
This paper introduces HeMPPCAT, a novel mixture model that effectively handles heteroscedastic noise in data, improving factor estimation and clustering over traditional MPPCA.
Contribution
HeMPPCAT extends MPPCA by modeling heteroscedastic noise and employs a GEM algorithm for joint estimation, addressing a key limitation of existing methods.
Findings
HeMPPCAT outperforms MPPCA in factor estimation accuracy.
HeMPPCAT achieves higher clustering accuracy in heterogeneous noise scenarios.
Simulation results validate the effectiveness of the proposed method.
Abstract
Mixtures of probabilistic principal component analysis (MPPCA) is a well-known mixture model extension of principal component analysis (PCA). Similar to PCA, MPPCA assumes the data samples in each mixture contain homoscedastic noise. However, datasets with heterogeneous noise across samples are becoming increasingly common, as larger datasets are generated by collecting samples from several sources with varying noise profiles. The performance of MPPCA is suboptimal for data with heteroscedastic noise across samples. This paper proposes a heteroscedastic mixtures of probabilistic PCA technique (HeMPPCAT) that uses a generalized expectation-maximization (GEM) algorithm to jointly estimate the unknown underlying factors, means, and noise variances under a heteroscedastic noise setting. Simulation results illustrate the improved factor estimates and clustering accuracies of HeMPPCAT…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpectroscopy and Chemometric Analyses · Bayesian Methods and Mixture Models · Advanced Chemical Sensor Technologies
MethodsPrincipal Components Analysis
