Input Sparsity and Hardness for Robust Subspace Approximation
Kenneth L. Clarkson, David P. Woodruff

TL;DR
This paper studies robust subspace approximation problems, providing algorithms, hardness results, and reductions for minimizing distance-based loss functions, especially for p in [1,2), with applications to robust regression.
Contribution
It introduces new algorithms and hardness results for robust subspace approximation, extending analysis to p in [1,2) and broad classes of loss functions, including M-Estimators.
Findings
Developed an efficient algorithm with O(nnz(A) + (n+d)poly(k/eps) + exp(poly(k/eps))) runtime.
Proved NP-hardness of the problem for p in [1,2), even approximately.
Provided problem-size reduction techniques and bicriteria solutions for M-Estimator loss functions.
Abstract
In the subspace approximation problem, we seek a k-dimensional subspace F of R^d that minimizes the sum of p-th powers of Euclidean distances to a given set of n points a_1, ..., a_n in R^d, for p >= 1. More generally than minimizing sum_i dist(a_i,F)^p,we may wish to minimize sum_i M(dist(a_i,F)) for some loss function M(), for example, M-Estimators, which include the Huber and Tukey loss functions. Such subspaces provide alternatives to the singular value decomposition (SVD), which is the p=2 case, finding such an F that minimizes the sum of squares of distances. For p in [1,2), and for typical M-Estimators, the minimizing gives a solution that is more robust to outliers than that provided by the SVD. We give several algorithmic and hardness results for these robust subspace approximation problems. We think of the n points as forming an n x d matrix A, and letting nnz(A) denote…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Computational Geometry and Mesh Generation · Complexity and Algorithms in Graphs
