Sparse Generalized Principal Component Analysis for Large-scale   Applications beyond Gaussianity

Qiaoya Zhang; Yiyuan She

arXiv:1512.03883·stat.CO·January 29, 2016

Sparse Generalized Principal Component Analysis for Large-scale Applications beyond Gaussianity

Qiaoya Zhang, Yiyuan She

PDF

Open Access

TL;DR

This paper introduces a scalable, sparse generalized PCA method that extends traditional PCA to exponential family distributions, effectively handling missing data and large-scale high-dimensional datasets.

Contribution

It generalizes sparse PCA to exponential family distributions with built-in missing data handling and proposes efficient, scalable algorithms with improved regularization and convergence properties.

Findings

01

Demonstrates efficiency in high-dimensional simulations

02

Shows effectiveness on real large-scale datasets

03

Achieves faster convergence with accelerated gradient

Abstract

Principal Component Analysis (PCA) is a dimension reduction technique. It produces inconsistent estimators when the dimensionality is moderate to high, which is often the problem in modern large-scale applications where algorithm scalability and model interpretability are difficult to achieve, not to mention the prevalence of missing values. While existing sparse PCA methods alleviate inconsistency, they are constrained to the Gaussian assumption of classical PCA and fail to address algorithm scalability issues. We generalize sparse PCA to the broad exponential family distributions under high-dimensional setup, with built-in treatment for missing values. Meanwhile we propose a family of iterative sparse generalized PCA (SG-PCA) algorithms such that despite the non-convexity and non-smoothness of the optimization task, the loss function decreases in every iteration. In terms of ease and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Face and Expression Recognition · Blind Source Separation Techniques

MethodsInterpretability · Principal Components Analysis