Statistical Methods and Workflow for Analyzing Human Metabolomics Data
Joseph Antonelli, Brian Claggett, Mir Henglin, Jeramie D. Watrous, Kim, A. Lehmann, Pavel Hushcha, Olga Demler, Samia Mora, Teemu Niiranen, Alexandre, C. Pereira, Mohit Jain, Susan Cheng

TL;DR
This paper reviews statistical methods for analyzing large-scale human metabolomics data, proposing a step-by-step workflow to improve data interpretation and address analytical challenges in the field.
Contribution
It introduces a comprehensive, standardized framework for statistical analysis of human metabolomics data, integrating current approaches and guiding future research.
Findings
Proposes a detailed workflow for metabolomics data analysis.
Highlights key analytical challenges and potential solutions.
Emphasizes the need for standardization in the field.
Abstract
High-throughput metabolomics investigations, when conducted in large human cohorts, represent a potentially powerful tool for elucidating the biochemical diversity and mechanisms underlying human health and disease. Large-scale metabolomics data, generated using targeted or nontargeted platforms, are increasingly more common. Appropriate statistical analysis of these complex high-dimensional data is critical for extracting meaningful results from such large-scale human metabolomics studies. Herein, we consider the main statistical analytical approaches that have been employed in human metabolomics studies. Based on the lessons learned and collective experience to date in the field, we propose a step-by-step framework for pursuing statistical analyses of human metabolomics data. We discuss the range of options and potential approaches that may be employed at each stage of data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMetabolomics and Mass Spectrometry Studies · Gene expression and cancer classification · Bioinformatics and Genomic Networks
