AdapDISCOM: An Adaptive Sparse Regression Method for High-Dimensional Multimodal Data With Block-Wise Missingness and Measurement Errors
Maimouna Bald\'e, Abdoul O. Diakit\'e, Claudia Moreau, Gleb Bezgin, Nikhil Bhagwat, Pedro Rosa-Neto, Jean-Baptiste Poline, Simon Girard, Amadou Barry (and for the Alzheimers Disease Neuroimaging Initiative)

TL;DR
AdapDISCOM is a new adaptive sparse regression method designed to handle high-dimensional multimodal data with block-wise missingness and measurement errors, improving prediction and feature selection in biomedical applications.
Contribution
It introduces modality-specific weighting schemes and robust variants, with theoretical guarantees and superior performance over existing methods in challenging data scenarios.
Findings
Outperforms existing methods like DISCOM, SCOM, and CoCoLasso in simulations.
Provides reliable biomarker selection and prediction in ADNI data.
Demonstrates robustness under heavy-tailed distributions and data contamination.
Abstract
Multimodal high-dimensional data are increasingly prevalent in biomedical research, yet they are often compromised by block-wise missingness and measurement errors, posing significant challenges for statistical inference and prediction. We propose AdapDISCOM, a novel adaptive direct sparse regression method that simultaneously addresses these two pervasive issues. Building on the DISCOM framework, AdapDISCOM introduces modality-specific weighting schemes to account for heterogeneity in data structures and error magnitudes across modalities. We establish the theoretical properties of AdapDISCOM, including model selection consistency and convergence rates under sub-Gaussian and heavy-tailed settings, and develop robust and computationally efficient variants (AdapDISCOM-Huber and Fast-AdapDISCOM). Extensive simulations demonstrate that AdapDISCOM consistently outperforms existing methods…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Stochastic Gradient Optimization Techniques · Gaussian Processes and Bayesian Inference
