Exact and Approximation Algorithms for Sparse PCA
Yongchun Li, Weijun Xie

TL;DR
This paper develops exact and approximate algorithms for Sparse PCA, improving solution quality and scalability, and extends the methods to related problems like Sparse SVD and Fair PCA.
Contribution
Introduces two exact MISDP formulations for Sparse PCA with stronger relaxations, and proposes scalable MILP and approximation algorithms with proven ratios.
Findings
Continuous relaxations are close to optimal solutions.
MILP can solve small to medium instances optimally.
Approximation algorithms perform well across instances.
Abstract
Sparse PCA (SPCA) is a fundamental model in machine learning and data analytics, which has witnessed a variety of application areas such as finance, manufacturing, biology, healthcare. To select a prespecified-size principal submatrix from a covariance matrix to maximize its largest eigenvalue for the better interpretability purpose, SPCA advances the conventional PCA with both feature selection and dimensionality reduction. This paper proposes two exact mixed-integer SDPs (MISDPs) by exploiting the spectral decomposition of the covariance matrix and the properties of the largest eigenvalues. We then analyze the theoretical optimality gaps of their continuous relaxation values and prove that they are stronger than that of the state-of-art one. We further show that the continuous relaxations of two MISDPs can be recast as saddle point problems without involving semi-definite cones, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Advanced Optimization Algorithms Research · Machine Learning and Algorithms
MethodsFeature Selection · Interpretability · Principal Components Analysis
