Theoretical Analysis of Sparse Subspace Clustering with Missing Entries
Manolis C. Tsakiris, Rene Vidal

TL;DR
This paper provides theoretical guarantees for Sparse Subspace Clustering (SSC) when applied to data with missing entries, showing that projecting zero-filled data improves clustering performance.
Contribution
It offers the first theoretical analysis of SSC with incomplete data and demonstrates the benefits of projecting onto observation patterns for better clustering accuracy.
Findings
Projection improves SSC performance with missing data
Projected data behave as complete points in projected subspaces
The analysis extends to self-expressive methods in general
Abstract
Sparse Subspace Clustering (SSC) is a popular unsupervised machine learning method for clustering data lying close to an unknown union of low-dimensional linear subspaces; a problem with numerous applications in pattern recognition and computer vision. Even though the behavior of SSC for complete data is by now well-understood, little is known about its theoretical properties when applied to data with missing entries. In this paper we give theoretical guarantees for SSC with incomplete data, and analytically establish that projecting the zero-filled data onto the observation pattern of the point being expressed leads to a substantial improvement in performance. The main insight that stems from our analysis is that even though the projection induces additional missing entries, this is counterbalanced by the fact that the projected and zero-filled data are in effect incomplete points…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Face and Expression Recognition · Anomaly Detection Techniques and Applications
