PCA in Data-Dependent Noise (Correlated-PCA): Nearly Optimal Finite Sample Guarantees
Namrata Vaswani, Praneeth Narayanamurthy

TL;DR
This paper analyzes PCA performance when noise depends on the data, providing nearly optimal finite-sample guarantees under weaker correlation assumptions than previous work.
Contribution
It introduces a nearly optimal sample complexity bound for correlated-PCA with weaker assumptions on data-noise correlation.
Findings
Improved sample complexity bounds for correlated-PCA.
Weaker assumptions on data-noise correlation compared to prior work.
Nearly optimal guarantees for PCA with data-dependent noise.
Abstract
We study Principal Component Analysis (PCA) in a setting where a part of the corrupting noise is data-dependent and, as a result, the noise and the true data are correlated. Under a bounded-ness assumption on the true data and the noise, and a simple assumption on data-noise correlation, we obtain a nearly optimal sample complexity bound for the most commonly used PCA solution, singular value decomposition (SVD). This bound is a significant improvement over the bound obtained by Vaswani and Guo in recent work (NIPS 2016) where this "correlated-PCA" problem was first studied; and it holds under a significantly weaker data-noise correlation assumption than the one used for this earlier result.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Blind Source Separation Techniques · Distributed Sensor Networks and Detection Algorithms
MethodsPrincipal Components Analysis
