Quantitative Comparison of Statistical Methods for Analyzing Human Metabolomics Data
Brian L. Claggett, Joseph Antonelli, Mir Henglin, Jeramie D. Watrous,, Kim A. Lehmann, Gabriel Musso, Andrew Correia, Sivani Jonnalagadda, Olga V., Demler, Ramachandran S. Vasan, Martin G. Larson, Mohit Jain, Susan Cheng

TL;DR
This study compares traditional and modern statistical methods for analyzing complex human metabolomics data, highlighting the advantages of sparse multivariate models in high-dimensional, small-cohort scenarios.
Contribution
It provides a comprehensive comparison of statistical approaches, emphasizing the effectiveness of sparse multivariate models for high-dimensional metabolomics data analysis.
Findings
Multivariate methods outperform univariate in high-correlation scenarios.
Sparse multivariate models show greater selectivity in large metabolite datasets.
In small cohorts, sparse models offer more robust statistical power.
Abstract
Background. Emerging technologies now allow for mass spectrometry based profiling of up to thousands of small molecule metabolites (metabolomics) in an increasing number of biosamples. While offering great promise for revealing insight into the pathogenesis of human disease, standard approaches have yet to be established for statistically analyzing increasingly complex, high-dimensional human metabolomics data in relation to clinical phenotypes including disease outcomes. To determine optimal statistical approaches for metabolomics analysis, we sought to formally compare traditional statistical as well as newer statistical learning methods across a range of metabolomics dataset types. Results. In simulated and experimental metabolomics data derived from large population-based human cohorts, we observed that with an increasing number of study subjects, univariate compared to multivariate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMetabolomics and Mass Spectrometry Studies · Gene expression and cancer classification
