Predictor-Informed Bayesian Nonparametric Clustering
Md Yasin Ali Parh, Jeremy T. Gaskins

TL;DR
This paper introduces a novel Bayesian nonparametric clustering method that incorporates predictor information through a pyramid group model, improving clustering accuracy and interpretability in complex data.
Contribution
It extends the Common Atoms Model by treating group membership as a latent variable with a nonparametric structure, and proposes a pyramid model for flexible predictor space partitioning.
Findings
Effective in simulation studies.
Successfully applied to health data predicting hospital stays.
Improves clustering coherence based on predictors.
Abstract
In this project we are interested in performing clustering of observations such that the cluster membership is influenced by a set of predictors. To that end, we employ the Bayesian nonparameteric Common Atoms Model, which is a nested clustering algorithm that utilizes a (fixed) group membership for each observation to encourage more similar clustering of members of the same group. CAM operates by assuming each group has its own vector of cluster probabilities, which are themselves clustered to allow similar clustering for some groups. We extend this approach by treating the group membership as an unknown latent variable determined as a flexible nonparametric form of the covariate vector. Consequently, observations with similar predictor values will be in the same latent group and are more likely to be clustered together than observations with disparate predictors. We propose a pyramid…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Statistical Methods and Bayesian Inference · Statistical Methods and Inference
