Graphical Dirichlet Process for Clustering Non-Exchangeable Grouped Data
Arhit Chakrabarti, Yang Ni, Ellen Ruth A. Morris, Michael L. Salinas,, Robert S. Chapkin, Bani K. Mallick

TL;DR
This paper introduces a Bayesian nonparametric model called the graphical Dirichlet process for clustering dependent, non-exchangeable grouped data structured by a directed acyclic graph, enabling shared clusters across groups.
Contribution
It proposes a novel graphical Dirichlet process that models dependencies among groups via a DAG, with new representations and an efficient inference algorithm.
Findings
Successfully models dependencies in non-exchangeable groups
Demonstrates effective clustering on simulated data
Applies method to real single-cell dataset
Abstract
We consider the problem of clustering grouped data with possibly non-exchangeable groups whose dependencies can be characterized by a known directed acyclic graph. To allow the sharing of clusters among the non-exchangeable groups, we propose a Bayesian nonparametric approach, termed graphical Dirichlet process, that jointly models the dependent group-specific random measures by assuming each random measure to be distributed as a Dirichlet process whose concentration parameter and base probability measure depend on those of its parent groups. The resulting joint stochastic process respects the Markov property of the directed acyclic graph that links the groups. We characterize the graphical Dirichlet process using a novel hypergraph representation as well as the stick-breaking representation, the restaurant-type representation, and the representation as a limit of a finite mixture…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Statistical Methods and Inference · Statistical Methods and Bayesian Inference
MethodsBalanced Selection
