Large-Scale Subspace Clustering via k-Factorization

Jicong Fan

arXiv:2012.04345·cs.LG·June 1, 2021·5 cites

Large-Scale Subspace Clustering via k-Factorization

Jicong Fan

PDF

Open Access 1 Repo

TL;DR

This paper introduces k-Factorization Subspace Clustering (k-FSC), a scalable method that directly factorizes data into subspaces, reducing computational complexity and effectively handling noise, outliers, and missing data in large datasets.

Contribution

The paper proposes k-FSC, a novel large-scale subspace clustering method that avoids affinity matrix learning, offers theoretical guarantees, and extends to streaming data.

Findings

01

k-FSC achieves linear time and space complexity.

02

It outperforms state-of-the-art methods on large datasets.

03

Handles noise, outliers, and missing data effectively.

Abstract

Subspace clustering (SC) aims to cluster data lying in a union of low-dimensional subspaces. Usually, SC learns an affinity matrix and then performs spectral clustering. Both steps suffer from high time and space complexity, which leads to difficulty in clustering large datasets. This paper presents a method called k-Factorization Subspace Clustering (k-FSC) for large-scale subspace clustering. K-FSC directly factorizes the data into k groups via pursuing structured sparsity in the matrix factorization model. Thus, k-FSC avoids learning affinity matrix and performing eigenvalue decomposition, and has low (linear) time and space complexity on large datasets. This paper proves the effectiveness of the k-FSC model theoretically. An efficient algorithm with convergence guarantee is proposed to solve the optimization of k-FSC. In addition, k-FSC is able to handle sparse noise, outliers, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jicongfan/K-Factorization-Subspace-Clustering
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace and Expression Recognition · Speech and Audio Processing · Blind Source Separation Techniques