Structure Learning of Latent Factors via Clique Search on Correlation Thresholded Graphs
Dale S. Kim, Qing Zhou

TL;DR
This paper introduces a fast correlation thresholding algorithm that learns the number of latent factors and the model structure in factor analysis, with theoretical guarantees and scalability to high-dimensional data.
Contribution
The paper proposes a novel clique search-based method that simultaneously determines the number of factors and identifies a rotationally identifiable structure, overcoming key limitations of existing methods.
Findings
The CT algorithm accurately recovers factor structures in simulations.
It scales efficiently to thousands of variables.
The method is robust to assumption violations.
Abstract
Despite the widespread application of latent factor analysis, existing methods suffer from the following weaknesses: requiring the number of factors to be known, lack of theoretical guarantees for learning the model structure, and nonidentifiability of the parameters due to rotation invariance properties of the likelihood. We address these concerns by proposing a fast correlation thresholding (CT) algorithm that simultaneously learns the number of latent factors and a rotationally identifiable model structure. Our novel approach translates this structure learning problem into the search for so-called independent maximal cliques in a thresholded correlation graph that can be easily constructed from the observed data. Our clique analysis technique scales well up to thousands of variables, while competing methods are not applicable in a reasonable amount of running time. We establish a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Bioinformatics and Genomic Networks · Data Visualization and Analytics
