Worst-case low-rank approximations
Anya Fries, Markus Reichstein, David Blei, Jonas Peters

TL;DR
This paper introduces wcPCA, a unified framework for worst-case low-rank approximation across heterogeneous domains, providing theoretical guarantees and demonstrating improved robustness in real-world applications.
Contribution
It develops a comprehensive framework for worst-case PCA, introduces new estimators like norm-minPCA and norm-maxregret, and extends the approach to matrix completion with theoretical optimality guarantees.
Findings
Improved worst-case performance in simulations and real-world data
Theoretical guarantees for estimators over source and target domains
Extensions to matrix completion with approximate worst-case optimality
Abstract
Real-world data in health, economics, and environmental sciences are often collected across heterogeneous domains (such as hospitals, regions, or time periods). In such settings, distributional shifts can make standard PCA unreliable, in that, for example, the leading principal components may explain substantially less variance in unseen domains than in the training domains. Existing approaches (such as FairPCA) have proposed to consider worst-case (rather than average) performance across multiple domains. This work develops a unified framework, called wcPCA, applies it to other objectives (resulting in the novel estimators such as norm-minPCA and norm-maxregret, which are better suited for applications with heterogeneous total variance) and analyzes their relationship. We prove that for all objectives, the estimators are worst-case optimal not only over the observed source domains but…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Remote-Sensing Image Classification · Statistical and numerical algorithms
