# Sparse Equisigned PCA: Algorithms and Performance Bounds in the Noisy   Rank-1 Setting

**Authors:** Arvind Prasadan, Raj Rao Nadakuditi, Debashis Paul

arXiv: 1905.09369 · 2019-12-17

## TL;DR

This paper introduces algorithms for sparse equisigned PCA that outperform traditional SVD in noisy, high-dimensional settings, providing theoretical bounds and practical validation for detecting sparse principal components.

## Contribution

It proposes new algorithms exploiting sparsity and equisignedness, with theoretical performance bounds and empirical validation in noisy, high-dimensional regimes.

## Key findings

- Sparse PCA algorithms recover signals below SVD's eigen-SNR threshold.
- Coordinate selection based on sum-type statistic outperforms norm-based schemes.
- Lower bounds on detectable coordinate size and worst-case risk are established.

## Abstract

Singular value decomposition (SVD) based principal component analysis (PCA) breaks down in the high-dimensional and limited sample size regime below a certain critical eigen-SNR that depends on the dimensionality of the system and the number of samples. Below this critical eigen-SNR, the estimates returned by the SVD are asymptotically uncorrelated with the latent principal components. We consider a setting where the left singular vector of the underlying rank one signal matrix is assumed to be sparse and the right singular vector is assumed to be equisigned, that is, having either only nonnegative or only nonpositive entries. We consider six different algorithms for estimating the sparse principal component based on different statistical criteria and prove that by exploiting sparsity, we recover consistent estimates in the low eigen-SNR regime where the SVD fails. Our analysis reveals conditions under which a coordinate selection scheme based on a \textit{sum-type decision statistic} outperforms schemes that utilize the $\ell_1$ and $\ell_2$ norm-based statistics. We derive lower bounds on the size of detectable coordinates of the principal left singular vector and utilize these lower bounds to derive lower bounds on the worst-case risk. Finally, we verify our findings with numerical simulations and illustrate the performance with a video data example, where the interest is in identifying objects.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1905.09369/full.md

## Figures

22 figures with captions in the complete paper: https://tomesphere.com/paper/1905.09369/full.md

## References

42 references — full list in the complete paper: https://tomesphere.com/paper/1905.09369/full.md

---
Source: https://tomesphere.com/paper/1905.09369