Nonconvex Statistical Optimization: Minimax-Optimal Sparse PCA in Polynomial Time
Zhaoran Wang, Huanran Lu, Han Liu

TL;DR
This paper introduces a two-stage framework combining convex relaxation and a novel iterative refinement algorithm to achieve minimax-optimal sparse PCA solutions efficiently in polynomial time, even under non-Gaussian and dependent data.
Contribution
It proposes a new two-stage method with theoretical guarantees for sparse PCA, including a novel iterative algorithm and analysis of its convergence and complexity.
Findings
The initial estimator from convex relaxation falls into the basin of attraction.
SOAP algorithm converges geometrically to the optimal sparse principal subspace.
Larger sample sizes reduce the total iteration complexity.
Abstract
Sparse principal component analysis (PCA) involves nonconvex optimization for which the global solution is hard to obtain. To address this issue, one popular approach is convex relaxation. However, such an approach may produce suboptimal estimators due to the relaxation effect. To optimally estimate sparse principal subspaces, we propose a two-stage computational framework named "tighten after relax": Within the 'relax' stage, we approximately solve a convex relaxation of sparse PCA with early stopping to obtain a desired initial estimator; For the 'tighten' stage, we propose a novel algorithm called sparse orthogonal iteration pursuit (SOAP), which iteratively refines the initial estimator by directly solving the underlying nonconvex problem. A key concept of this two-stage framework is the basin of attraction. It represents a local region within which the `tighten' stage has desired…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Statistical Methods and Inference · Blind Source Separation Techniques
MethodsEarly Stopping · Principal Components Analysis
