TARO: tree-aggregated factor regression for microbiome data integration
Aditya K Mishra, Iqbal Mahmud, Philip L Lorenzi, Robert R Jenq, Jennifer A Wargo, Nadim J Ajami, Christine B Peterson

TL;DR
TARO is a new method that integrates microbiome and metabolomic data to better understand how gut microbes influence metabolites, especially in colorectal cancer screening.
Contribution
TARO introduces a tree-aggregated factor regression approach that leverages taxonomic structure to handle microbiome data challenges.
Findings
TARO accurately recovers low-rank coefficient matrices in simulations.
TARO identifies relevant features in microbiome-metabolomic associations.
TARO was applied to colorectal cancer screening data to explore gut microbe-metabolite relationships.
Abstract
Although the human microbiome plays a key role in health and disease, the biological mechanisms underlying the interaction between the microbiome and its host are incompletely understood. Integration with other molecular profiling data offers an opportunity to characterize the role of the microbiome and elucidate therapeutic targets. However, this remains challenging to the high dimensionality, compositionality, and rare features found in microbiome profiling data. These challenges necessitate the use of methods that can achieve structured sparsity in learning cross-platform association patterns. We propose Tree-Aggregated factor RegressiOn (TARO) for the integration of microbiome and metabolomic data. We leverage information on the taxonomic tree structure to flexibly aggregate rare features. We demonstrate through simulation studies that TARO accurately recovers a low-rank…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMetabolomics and Mass Spectrometry Studies · Bioinformatics and Genomic Networks · Gut microbiota and health
