Subspace Clustering on Incomplete Data with Self-Supervised Contrastive Learning
Huanran Li, Daniel Pimentel-Alarc\'on

TL;DR
This paper introduces a self-supervised contrastive learning framework for subspace clustering that effectively handles incomplete data with missing entries, outperforming existing methods in robustness and scalability.
Contribution
It proposes Contrastive Subspace Clustering (CSC), a novel framework that leverages masked views and contrastive learning to improve clustering on incomplete datasets.
Findings
CSC outperforms classical and deep learning baselines.
Demonstrates robustness to missing data.
Scales effectively to large datasets.
Abstract
Subspace clustering aims to group data points that lie in a union of low-dimensional subspaces and finds wide application in computer vision, hyperspectral imaging, and recommendation systems. However, most existing methods assume fully observed data, limiting their effectiveness in real-world scenarios with missing entries. In this paper, we propose a contrastive self-supervised framework, Contrastive Subspace Clustering (CSC), designed for clustering incomplete data. CSC generates masked views of partially observed inputs and trains a deep neural network using a SimCLR-style contrastive loss to learn invariant embeddings. These embeddings are then clustered using sparse subspace clustering. Experiments on six benchmark datasets show that CSC consistently outperforms both classical and deep learning baselines, demonstrating strong robustness to missing data and scalability to large…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Domain Adaptation and Few-Shot Learning · Remote-Sensing Image Classification
