A Bayesian Approach to Restricted Latent Class Models for Scientifically-Structured Clustering of Multivariate Binary Outcomes
Zhenke Wu, Livia Casciola-Rosen, Antony Rosen, Scott L. Zeger

TL;DR
This paper introduces a Bayesian framework for clustering multivariate binary data with scientific restrictions, incorporating prior knowledge and ensuring parameter identifiability, demonstrated through an application to auto-antibody data.
Contribution
It develops a novel Bayesian method that combines evidence of varying quality, respects scientific context restrictions, and estimates the number of clusters and latent structures simultaneously.
Findings
Effective clustering of biomedical binary data demonstrated
Method identifies meaningful latent classes with sparse patterns
Algorithm incorporates external prior knowledge
Abstract
In this paper, we propose a general framework for combining evidence of varying quality to estimate underlying binary latent variables in the presence of restrictions imposed to respect the scientific context. The resulting algorithms cluster the multivariate binary data in a manner partly guided by prior knowledge. The primary model assumptions are that 1) subjects belong to classes defined by unobserved binary states, such as the true presence or absence of pathogens in epidemiology, or of antibodies in medicine, or the "ability" to correctly answer test questions in psychology, 2) a binary design matrix specifies relevant features in each class, and 3) measurements are independent given the latent class but can have different error rates. Conditions ensuring parameter identifiability from the likelihood function are discussed and inform the design of a novel posterior…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Statistical Methods and Bayesian Inference · Statistical Methods and Inference
