Latent Association Mining in Binary Data
Carson Mosso, Kelly Bodwin, Suman Chakraborty, Kai Zhang, and Andrew, B. Nobel

TL;DR
The paper introduces LAMB, a novel method for identifying stable, mutually associated feature sets in high-dimensional binary data by modeling observed data as thresholded latent continuous variables, effectively uncovering meaningful associations.
Contribution
The paper presents LAMB, a new iterative testing method that detects stable associations in binary data based on a latent continuous model, improving over existing approaches.
Findings
LAMB effectively identifies meaningful feature associations in binary datasets.
LAMB's performance is competitive with methods using full count data.
The method works well on both artificial and real datasets.
Abstract
We consider the problem of identifying stable sets of mutually associated features in moderate or high-dimensional binary data. In this context we develop and investigate a method called Latent Association Mining for Binary Data (LAMB). The LAMB method is based on a simple threshold model in which the observed binary values represent a random thresholding of a latent continuous vector that may have a complex association structure. We consider a measure of latent association that quantifies association in the latent continuous vector without bias due to the random thresholding. The LAMB method uses an iterative testing based search procedure to identify stable sets of mutually associated features. We compare the LAMB method with several competing methods on artificial binary-valued datasets and two real count-valued datasets. The LAMB method detects meaningful associations in these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications · Rough Sets and Fuzzy Logic · Data Management and Algorithms
