It's All Relative: New Regression Paradigm for Microbiome Compositional Data
Gen Li, Yan Li, Kun Chen

TL;DR
This paper introduces a novel regression framework for microbiome compositional data that directly models compositions, improving interpretability and feature selection across taxonomic levels, with proven efficacy in simulations and real data.
Contribution
A new relative-shift regression paradigm for microbiome data that handles zeros and taxonomy, with regularization and efficient algorithms for feature selection and interpretation.
Findings
Effective in identifying relevant microbes at different taxonomic levels.
Outperforms existing methods in simulation studies.
Reveals novel microbiome-neurodevelopment associations.
Abstract
Microbiome data are complex in nature, involving high dimensionality, compositionally, zero inflation, and taxonomic hierarchy. Compositional data reside in a simplex that does not admit the standard Euclidean geometry. Most existing compositional regression methods rely on transformations that are inadequate or even inappropriate in modeling data with excessive zeros and taxonomic structure. We develop a novel relative-shift regression framework that directly uses compositions as predictors. The new framework provides a paradigm shift for compositional regression and offers a superior biological interpretation. New equi-sparsity and taxonomy-guided regularization methods and an efficient smoothing proximal gradient algorithm are developed to facilitate feature aggregation and dimension reduction in regression. As a result, the framework can automatically identify clinically relevant…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeochemistry and Geologic Mapping · Oral microbiology and periodontitis research · Metabolomics and Mass Spectrometry Studies
