# EXPLANA: a user-friendly workflow for EXPLoratory ANAlysis and feature selection in cross-sectional and longitudinal microbiome studies

**Authors:** Jennifer Fouquier, Maggie Stanislawski, John O’Connor, Ashley Scadden, Catherine Lozupone

PMC · DOI: 10.1093/bioinformatics/btaf658 · Bioinformatics · 2025-12-19

## TL;DR

EXPLANA is a user-friendly tool for analyzing microbiome data over time, helping identify important features related to outcomes in both cross-sectional and longitudinal studies.

## Contribution

EXPLANA introduces a novel workflow combining machine learning and change analysis for feature selection in longitudinal microbiome studies.

## Key findings

- EXPLANA outperformed existing tools like QIIME 2 feature-volatility in simulated data with a balanced accuracy of 0.91.
- The tool identified novel order-dependent categorical feature changes, such as differences between A_B and B_A transitions.
- EXPLANA generates interactive reports summarizing methods and results for both numerical and categorical data.

## Abstract

Longitudinal microbiome studies (LMS) are increasingly common but have analytic challenges including nonindependent data requiring mixed-effects models. Furthermore, large amounts of data motivate exploratory analysis to identify factors related to outcome variables. Although change analysis (i.e. calculating feature changes between timepoints) can be powerful, how to best conduct these analyses is often unclear. For example, observational LMS measurements show natural fluctuations, so baseline might not be a reference of primary interest, whereas for interventional LMS, baseline is typically a key reference point, often indicating the start of treatment.

To address these challenges, a feature selection workflow, called EXPLANA (EXPLoratory ANAlysis), was developed for LMS that supports numerical and categorical data, and also accommodates cross-sectional studies. Machine learning methods were combined with different types of change calculations and downstream interpretation methods to identify statistically meaningful variables and explain their relationship to outcomes. EXPLANA generates an interactive report that textually and graphically summarizes methods and results. EXPLANA had good performance on simulated longitudinal data, with a balanced accuracy score of 0.91 (range: 0.79–1.00, SD = 0.05), outperformed an existing tool, QIIME 2 feature-volatility (balanced accuracy: 0.95 versus 0.56) and identified novel order-dependent categorical feature changes (e.g. different effect for A_B versus B_A). EXPLANA is broadly applicable and simplifies analytics for identifying features related to outcomes of interest.

Software is available at https://github.com/JTFouquier/explana and https://zenodo.org/records/17478745 (10.5281/zenodo.17478744). Documentation and demos are available at www.explana.io.

## Full-text entities

- **Genes:** SHROOM4 (shroom family member 4) [NCBI Gene 57477] {aka MRXSSDS, SHAP, shrm4}, F5 (coagulation factor V) [NCBI Gene 2153] {aka FVL, PCCF, RPRGL1, THPH2, fV}
- **Diseases:** ML (MESH:D007859), gastrointestinal distress (MESH:D012128), Inflammatory Disease (MESH:D007249), heart arrhythmia (MESH:D001145), FV (OMIM:600512), depression (MESH:D003866), cardiovascular disease (MESH:D002318), HIV (MESH:D015658), cancer (MESH:D009369), heartbeat (MESH:D005117), ASD (MESH:D000067877), obesity (MESH:D009765), Clostridioides difficile infection (MESH:D003015), LMS (MESH:D017887)
- **Chemicals:** amiodarone (MESH:D000638), quinidine (MESH:D011802)
- **Species:** Ruminococcus (genus) [taxon 1263], Homo sapiens (human, species) [taxon 9606], gut metagenome (species) [taxon 749906], Lactococcus (lactic streptococci, genus) [taxon 1357], Allobaculum (genus) [taxon 174708], Anaerostipes (genus) [taxon 207244], Blautia (genus) [taxon 572511], Canis lupus familiaris (dog, subspecies) [taxon 9615], Veillonella (genus) [taxon 29465], Roseburia (genus) [taxon 841], Bifidobacterium (genus) [taxon 1678], Paracoccus (genus) [taxon 249411], Bacteria Latreille et al. 1825 (Bacteria stick insect, genus) [taxon 629395]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12766912/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12766912/full.md

## References

57 references — full list in the complete paper: https://tomesphere.com/paper/PMC12766912/full.md

---
Source: https://tomesphere.com/paper/PMC12766912