TL;DR
This paper introduces two new QIIME 2 plugins, q2-classo and q2-gglasso, for penalized regression, classification, and microbial network estimation tailored to high-dimensional, compositional microbiome data.
Contribution
The paper presents novel plugins for QIIME 2 that enable sparse regression, classification, and network modeling specifically designed for microbial compositional data.
Findings
Demonstrated robust model selection and classification on Atacama soil microbiome data.
Enabled estimation of microbial association networks using sparse graphical models.
Decomposed taxon-taxon associations into direct interactions and latent factors.
Abstract
Motivation: Statistical analysis of microbial count data derived from 16S rRNA or metagenomics sequencing poses unique challenges due to the sparse, compositional, and high-dimensional nature of the data. While QIIME 2 already provides many tools for data pre-processing and analysis, plugins for statistical regression, classification, and microbial network estimation tailored to compositional count data are relatively scarce. Results: We present q2-classo and q2-gglasso, two novel QIIME 2 plugins that implement penalized regression, classification, and graphical modeling approaches for microbial compositional data. q2-classo enables the prediction of a continuous or binary outcome of interest using compositional microbiome data as predictors. Both sparse log-contrast regression and classification, as well as tree-aggregated log-contrast models are available. q2-gglasso enables the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
