Subspace Clustering on Incomplete Data with Self-Supervised Contrastive Learning

Huanran Li; Daniel Pimentel-Alarc\'on

arXiv:2602.00262·cs.CV·February 3, 2026

Subspace Clustering on Incomplete Data with Self-Supervised Contrastive Learning

Huanran Li, Daniel Pimentel-Alarc\'on

PDF

Open Access

TL;DR

This paper introduces a self-supervised contrastive learning framework for subspace clustering that effectively handles incomplete data with missing entries, outperforming existing methods in robustness and scalability.

Contribution

It proposes Contrastive Subspace Clustering (CSC), a novel framework that leverages masked views and contrastive learning to improve clustering on incomplete datasets.

Findings

01

CSC outperforms classical and deep learning baselines.

02

Demonstrates robustness to missing data.

03

Scales effectively to large datasets.

Abstract

Subspace clustering aims to group data points that lie in a union of low-dimensional subspaces and finds wide application in computer vision, hyperspectral imaging, and recommendation systems. However, most existing methods assume fully observed data, limiting their effectiveness in real-world scenarios with missing entries. In this paper, we propose a contrastive self-supervised framework, Contrastive Subspace Clustering (CSC), designed for clustering incomplete data. CSC generates masked views of partially observed inputs and trains a deep neural network using a SimCLR-style contrastive loss to learn invariant embeddings. These embeddings are then clustered using sparse subspace clustering. Experiments on six benchmark datasets show that CSC consistently outperforms both classical and deep learning baselines, demonstrating strong robustness to missing data and scalability to large…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace and Expression Recognition · Domain Adaptation and Few-Shot Learning · Remote-Sensing Image Classification