Identification of Separable OTUs for Multinomial Classification in Compositional Data Analysis
R. Alberich, N.A. Cruz, R. Fern\'andez, I. Garc\'ia Mosquera, A. Mir, F. Rossell\'o

TL;DR
This paper introduces a novel multinomial classification method for microbiome compositional data that uses penalized log-ratio regression and pairwise separability screening to identify discriminant OTUs with confidence intervals.
Contribution
It develops an interpretable, probabilistic framework combining log-ratio modeling, covariate adjustment, and uncertainty estimation for OTU selection in microbiome analysis.
Findings
Recovered known discriminant taxa in colorectal adenoma dataset
Revealed additional relevant OTUs after covariate adjustment
Improved separation index precision with demographic covariates
Abstract
High-throughput sequencing has transformed microbiome research, but it also produces inherently compositional data that challenge standard statistical and machine learning methods. In this work, we propose a multinomial classification framework for compositional microbiome data based on penalized log-ratio regression and pairwise separability screening. The method quantifies the discriminative ability of each OTU through the area under the receiver operating characteristic curve () for all pairwise log-ratios and aggregates these values into a global separability index , yielding interpretable rankings of taxa together with confidence intervals. We illustrate the approach by reanalyzing the Baxter colorectal adenoma dataset and comparing our results with Greenacre's ordination-based analysis using Correspondence Analysis and Canonical Correspondence Analysis. Our models…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeochemistry and Geologic Mapping · Oral microbiology and periodontitis research · Hydrocarbon exploration and reservoir analysis
