# Orthonormal pairwise logratio selection (OPALS) algorithm for compositional data analysis in high dimensions

**Authors:** Paulína Jašková, Javier Palarea-Albaladejo, Karel Hron, Dominik Lachman, Matthias Templ, Magali Berland

PMC · DOI: 10.1093/bioadv/vbaf229 · Bioinformatics Advances · 2025-10-01

## TL;DR

The paper introduces the OPALS algorithm to efficiently analyze high-dimensional compositional data using pairwise logratios.

## Contribution

The novel contribution is an algorithm that computes orthonormal pairwise logratios efficiently using Latin squares theory.

## Key findings

- OPALS reduces computational burden for high-dimensional compositional data analysis.
- The algorithm is demonstrated on molecular biology data examples.
- Relationships between orthonormal pairwise logratios and pivot coordinates are explored in regression and classification.

## Abstract

In the analysis of compositional data, the most fundamental information is conveyed by the pairwise logratios between components. While logratio coordinate representations, such as balances and pivot coordinates, are widely used to aggregate such information into higher-level relationships, there are instances where a fine-grained representation using all pairwise logratios can be advantageous. Performing this within an orthonormal (or orthogonal) logratio coordinate framework becomes particularly challenging for high-dimensional compositions, since a composition with D parts results in D(D−1)/2 pairwise logratios (excluding reciprocals). This work presents an efficient algorithm (OPALS) based on Latin squares theory to obtain all orthonormal pairwise logratios from just D−1 logratio coordinate systems. Thus, the computational burden associated with using such representation for data analysis and modelling in high dimensions is notably alleviated, or even made feasible. Moreover, the relationship between estimates from orthonormal pairwise logratios and ordinary pivot coordinates is discussed in the context of regression and classification analysis.

The OPALS algorithm is described in detail in this article and can be implemented directly from the provided methodology. The performance and properties of the method are illustrated through two examples using contemporary molecular biology data.

## Full-text entities

- **Diseases:** liver diseases (MESH:D008107), Liver cirrhosis (MESH:D008103), CoDA (MESH:D058617)
- **Chemicals:** PropionateCH3.1 (-), amino acids (MESH:D000596), butyrate (MESH:D002087), fatty acids (MESH:D005227), propionate (MESH:D011422), acetate (MESH:D000085), glucose (MESH:D005947), hypoxanthine (MESH:D019271), CH4 (MESH:D008697), uracil (MESH:D014498), tyrosine (MESH:D014443)
- **Species:** Homo sapiens (human, species) [taxon 9606], Bos taurus (bovine, species) [taxon 9913], gut metagenome (species) [taxon 749906]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12641611/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12641611/full.md

## References

41 references — full list in the complete paper: https://tomesphere.com/paper/PMC12641611/full.md

---
Source: https://tomesphere.com/paper/PMC12641611