Automatic sparse PCA for high-dimensional data

Kazuyoshi Yata; Makoto Aoshima

arXiv:2209.14891·stat.ME·May 29, 2023

Automatic sparse PCA for high-dimensional data

Kazuyoshi Yata, Makoto Aoshima

PDF

Open Access

TL;DR

This paper introduces a novel thresholding estimator for sparse PCA that improves accuracy and efficiency in high-dimensional data analysis by reducing dependence on threshold selection.

Contribution

It proposes a noise-reduction based thresholding estimator for sparse PCA that is consistent, threshold-independent, and computationally efficient.

Findings

01

The new estimator achieves accurate principal component directions.

02

It performs well in clustering high-dimensional data.

03

The method is faster and more reliable than existing approaches.

Abstract

Sparse principal component analysis (SPCA) methods have proven to efficiently analyze high-dimensional data. Among them, threshold-based SPCA (TSPCA) is computationally more cost-effective than regularized SPCA, based on L1 penalties. We herein present an investigation of the efficacy of TSPCA for high-dimensional data settings and illustrate that, for a suitable threshold value, TSPCA achieves satisfactory performance for high-dimensional data. Thus, the performance of the TSPCA depends heavily on the selected threshold value. To this end, we propose a novel thresholding estimator to obtain the principal component (PC) directions using a customized noise-reduction methodology. The proposed technique is consistent under mild conditions, unaffected by threshold values, and therefore yields more accurate results quickly at a lower computational cost. Furthermore, we explore the shrinkage…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBlind Source Separation Techniques · Sparse and Compressive Sensing Techniques · Face and Expression Recognition