Bayesian nonparametric Principal Component Analysis

Cl\'ement Elvira; Pierre Chainais; Nicolas Dobigeon

arXiv:1709.05667·stat.ML·September 19, 2017·5 cites

Bayesian nonparametric Principal Component Analysis

Cl\'ement Elvira, Pierre Chainais, Nicolas Dobigeon

PDF

Open Access

TL;DR

This paper introduces BNP-PCA, a Bayesian nonparametric approach that infers the number of principal components automatically, providing a fully Bayesian framework for dimension reduction and its integration with clustering or latent factor models.

Contribution

It proposes a novel Bayesian nonparametric PCA model using Indian buffet process and Stiefel manifold priors, with new estimators for subspace dimension and significance testing.

Findings

01

Effective in synthetic examples

02

Automatically infers the number of components

03

Integrates with clustering and latent factor models

Abstract

Principal component analysis (PCA) is very popular to perform dimension reduction. The selection of the number of significant components is essential but often based on some practical heuristics depending on the application. Only few works have proposed a probabilistic approach able to infer the number of significant components. To this purpose, this paper introduces a Bayesian nonparametric principal component analysis (BNP-PCA). The proposed model projects observations onto a random orthogonal basis which is assigned a prior distribution defined on the Stiefel manifold. The prior on factor scores involves an Indian buffet process to model the uncertainty related to the number of components. The parameters of interest as well as the nuisance parameters are finally inferred within a fully Bayesian framework via Monte Carlo sampling. A study of the (in-)consistence of the marginal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRemote-Sensing Image Classification · Spectroscopy and Chemometric Analyses · Bayesian Methods and Mixture Models