Principal Balances of Compositional Data for Regression and Classification using Partial Least Squares
V. Nesrstov\'a (1), I. Wilms (2), J. Palarea-Albaladejo (3), P., Filzmoser (4), J. A. Mart\'in-Fern\'andez (3), D. Friedeck\'y (5), K. Hron, (1) ((1) Department of Mathematical Analysis, Applications of Mathematics,, Palack\'y University Olomouc, Faculty of Science

TL;DR
This paper introduces a novel PLS-based method for constructing principal balances in high-dimensional compositional data, enhancing interpretability and maximizing response variability for regression and classification tasks.
Contribution
It develops a new PLS approach that constructs multiple orthonormal log-contrast balances, improving interpretability and performance over traditional methods.
Findings
Effective in capturing variability in compositional data
Improves interpretability in regression and classification
Performs well on simulated and real datasets
Abstract
High-dimensional compositional data are commonplace in the modern omics sciences amongst others. Analysis of compositional data requires a proper choice of orthonormal coordinate representation as their relative nature is not compatible with the direct use of standard statistical methods. Principal balances, a specific class of log-ratio coordinates, are well suited to this context since they are constructed in such a way that the first few coordinates capture most of the variability in the original data. Focusing on regression and classification problems in high dimensions, we propose a novel Partial Least Squares (PLS) based procedure to construct principal balances that maximize explained variability of the response variable and notably facilitates interpretability when compared to the ordinary PLS formulation. The proposed PLS principal balance approach can be understood as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeochemistry and Geologic Mapping · Metabolomics and Mass Spectrometry Studies
