lp-Recovery of the Most Significant Subspace among Multiple Subspaces   with Outliers

Gilad Lerman; Teng Zhang

arXiv:1012.4116·stat.ML·July 24, 2015

lp-Recovery of the Most Significant Subspace among Multiple Subspaces with Outliers

Gilad Lerman, Teng Zhang

PDF

TL;DR

This paper investigates the problem of recovering the most significant subspace from data with multiple subspaces and outliers by using lp minimization, showing conditions under which recovery is possible or impossible.

Contribution

It introduces a novel analysis of lp minimization for subspace recovery, demonstrating its effectiveness for 0<p<=1 and limitations for p>1.

Findings

01

For 0<p<=1, the most significant subspace can be recovered with high probability despite outliers.

02

Adding small noise allows near recovery of the subspace proportional to noise level.

03

For p>1, the most significant subspace cannot be reliably recovered when multiple subspaces are present.

Abstract

We assume data sampled from a mixture of d-dimensional linear subspaces with spherically symmetric distributions within each subspace and an additional outlier component with spherically symmetric distribution within the ambient space (for simplicity we may assume that all distributions are uniform on their corresponding unit spheres). We also assume mixture weights for the different components. We say that one of the underlying subspaces of the model is most significant if its mixture weight is higher than the sum of the mixture weights of all other subspaces. We study the recovery of the most significant subspace by minimizing the lp-averaged distances of data points from d-dimensional subspaces, where p>0. Unlike other lp minimization problems, this minimization is non-convex for all p>0 and thus requires different methods for its analysis. We show that if 0<p<=1, then for any…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.