Factor Adjusted Spectral Clustering for Mixture Models
Shange Tang, Soham Jana, Jianqing Fan

TL;DR
This paper introduces the Factor Adjusted Spectral Clustering (FASC) algorithm, which effectively clusters high-dimensional, correlated data by removing latent factor influences, outperforming traditional methods especially in complex dependence structures.
Contribution
The paper proposes a novel FASC algorithm that denoises data by eliminating factor components, achieving low misclassification rates and computational efficiency in high-dimensional clustering tasks.
Findings
FASC achieves exponentially low mislabeling rates under general assumptions.
FASC outperforms traditional spectral clustering on correlated high-dimensional data.
The method is computationally efficient with near-linear complexity.
Abstract
This paper studies a factor modeling-based approach for clustering high-dimensional data generated from a mixture of strongly correlated variables. Statistical modeling with correlated structures pervades modern applications in economics, finance, genomics, wireless sensing, etc., with factor modeling being one of the popular techniques for explaining the common dependence. Standard techniques for clustering high-dimensional data, e.g., naive spectral clustering, often fail to yield insightful results as their performances heavily depend on the mixture components having a weakly correlated structure. To address the clustering problem in the presence of a latent factor model, we propose the Factor Adjusted Spectral Clustering (FASC) algorithm, which uses an additional data denoising step via eliminating the factor component to cope with the data dependency. We prove this method achieves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models
MethodsSparse Evolutionary Training · Spectral Clustering
