Multiple two-sample testing under arbitrary covariance dependency with an application in imaging mass spectrometry
Vladimir Vutov, Thorsten Dickhaus

TL;DR
This paper develops a new statistical method for large-scale two-sample testing that accounts for complex dependencies among tests, specifically applied to imaging mass spectrometry data in cancer research.
Contribution
It introduces a marginal model-based inference procedure that estimates the correlation structure among test statistics for improved false discovery control.
Findings
Method effectively controls false discovery proportion in dependent data
Application to MALDI IMS data reveals significant molecular associations
Provides a practical workflow for high-dimensional dependent hypothesis testing
Abstract
Large-scale hypothesis testing has become a ubiquitous problem in high-dimensional statistical inference, with broad applications in various scienfitic disciplines. One relevant application is constituted by imaging mass spectrometry (IMS) association studies, where a large number of tests are performed simultaneously in order to identify molecular masses that are associated with a particular phenotype, e. g., a cancer subtype. Mass spectra obtained from Matrix-assisted laser desorption/ionization (MALDI) experiments are dependent, when considered as statistical quantities. False discovery proportion (FDP) control under arbitrary dependency structure among test statistics is an active topic in modern multiple testing research. In this context, we are concerned with the evaluation of associations between the binary outcome variable (describing the phenotype) and multiple predictors…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPesticide Residue Analysis and Safety · Statistical Methods and Inference · Spectroscopy and Chemometric Analyses
