Subspace approximation with outliers
Amit Deshpande, Rameshwar Pratap

TL;DR
This paper introduces an efficient algorithm for subspace approximation with outliers, achieving near-optimal solutions under certain error assumptions, even with a high fraction of outliers, by extending dimension reduction and sampling techniques.
Contribution
It extends dimension reduction and sampling methods to handle outliers in subspace approximation, overcoming the SSE-hardness of robust subspace recovery under specific error conditions.
Findings
Provides a polynomial-time algorithm with linear dependence on n and d.
Achieves a (1+ε)-approximation for the optimal subspace.
Works even with large outlier fractions under certain error assumptions.
Abstract
The subspace approximation problem with outliers, for given points in dimensions , an integer , and an outlier parameter , is to find a -dimensional linear subspace of that minimizes the sum of squared distances to its nearest points. More generally, the subspace approximation problem with outliers minimizes the sum of -th powers of distances instead of the sum of squared distances. Even the case of robust PCA is non-trivial, and previous work requires additional assumptions on the input. Any multiplicative approximation algorithm for the subspace approximation problem with outliers must solve the robust subspace recovery problem, a special case in which the inliers in the optimal solution are promised to lie exactly on a -dimensional linear subspace.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsPrincipal Components Analysis
