Dirichlet-tree multinomial mixtures for clustering microbiome compositions
Jialiang Mao, Li Ma

TL;DR
This paper introduces a Bayesian clustering model called Dirichlet-tree multinomial mixtures (DTMM) that effectively captures complex within-cluster variation in microbiome composition data, improving clustering accuracy in microbiome studies.
Contribution
The paper proposes DTMM, a flexible Bayesian model utilizing phylogenetic information for clustering microbiome compositions, addressing limitations of existing methods.
Findings
DTMM outperforms existing clustering methods in simulations.
DTMM identifies meaningful microbiome clusters in AGP data.
DTMM reveals signature taxa distinguishing clusters.
Abstract
Studying the human microbiome has gained substantial interest in recent years, and a common task in the analysis of these data is to cluster microbiome compositions into subtypes. This subdivision of samples into subgroups serves as an intermediary step in achieving personalized diagnosis and treatment. In applying existing clustering methods to modern microbiome studies including the American Gut Project (AGP) data, we found that this seemingly standard task, however, is very challenging in the microbiome composition context due to several key features of such data. Standard distance-based clustering algorithms generally do not produce reliable results as they do not take into account the heterogeneity of the cross-sample variability among the bacterial taxa, while existing model-based approaches do not allow sufficient flexibility for the identification of complex within-cluster…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGut microbiota and health · Bayesian Methods and Mixture Models · Colorectal Cancer Screening and Detection
