Bayesian nonparametric Principal Component Analysis
Cl\'ement Elvira, Pierre Chainais, Nicolas Dobigeon

TL;DR
This paper introduces BNP-PCA, a Bayesian nonparametric approach that infers the number of principal components automatically, providing a fully Bayesian framework for dimension reduction and its integration with clustering or latent factor models.
Contribution
It proposes a novel Bayesian nonparametric PCA model using Indian buffet process and Stiefel manifold priors, with new estimators for subspace dimension and significance testing.
Findings
Effective in synthetic examples
Automatically infers the number of components
Integrates with clustering and latent factor models
Abstract
Principal component analysis (PCA) is very popular to perform dimension reduction. The selection of the number of significant components is essential but often based on some practical heuristics depending on the application. Only few works have proposed a probabilistic approach able to infer the number of significant components. To this purpose, this paper introduces a Bayesian nonparametric principal component analysis (BNP-PCA). The proposed model projects observations onto a random orthogonal basis which is assigned a prior distribution defined on the Stiefel manifold. The prior on factor scores involves an Indian buffet process to model the uncertainty related to the number of components. The parameters of interest as well as the nuisance parameters are finally inferred within a fully Bayesian framework via Monte Carlo sampling. A study of the (in-)consistence of the marginal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote-Sensing Image Classification · Spectroscopy and Chemometric Analyses · Bayesian Methods and Mixture Models
