Multidimensional scaling informed by $F$-statistic: Visualizing grouped microbiome data with inference
Hyungseok Kim, Soobin Kim, Jeffrey A. Kimbrel, Megan M. Morris, Xavier, Mayali, Cullen R. Buie

TL;DR
This paper introduces an $F$-statistic informed multidimensional scaling method for microbiome data visualization, which is robust, statistically sound, and comparable to existing techniques in preserving data structure.
Contribution
The study presents a novel MDS-based ordination method that integrates $F$-statistics for improved, assumption-free visualization of microbiome data.
Findings
Robustness to hyperparameter selection
Maintains statistical significance in ordination
Comparable to state-of-the-art methods in data structure preservation
Abstract
Multidimensional scaling (MDS) is a dimensionality reduction technique for microbial ecology data analysis that represents the multivariate structure while preserving pairwise distances between samples. While its improvement has enhanced the ability to reveal data patterns by sample groups, these MDS-based methods require prior assumptions for inference, limiting their application in general microbiome analysis. In this study, we introduce a new MDS-based ordination, -informed MDS, which configures the data distribution based on the -statistic, the ratio of dispersion between groups sharing common and different characteristics. Using simulated compositional datasets, we demonstrate that the proposed method is robust to hyperparameter selection while maintaining statistical significance throughout the ordination process. Various quality metrics for evaluating dimensionality…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Genomics and Phylogenetic Studies · Metabolomics and Mass Spectrometry Studies
