Robust nonparametric integrative analysis to decipher heterogeneity and commonality across subgroups using sparse boosting
Zihan Li, Ziye Luo, Yifan Sun

TL;DR
This paper introduces a robust nonparametric integrative analysis method using sparse boosting to identify heterogeneity and commonality across subgroups in biomedical data, accommodating nonlinear effects and data contamination.
Contribution
It develops a novel robust, nonparametric, sparse boosting approach that handles data contamination and nonlinear effects for integrative subgroup analysis.
Findings
Effective in simulations for heterogeneous data
Identifies biologically meaningful patterns in cancer data
Provides satisfactory prediction performance
Abstract
In many biomedical problems, data are often heterogeneous, with samples spanning multiple patient subgroups, where different subgroups may have different disease subtypes, stages, or other medical contexts. These subgroups may be related, but they are also expected to have differences with respect to the underlying biology. The heterogeneous data presents a precious opportunity to explore the heterogeneities and commonalities between related subgroups. Unfortunately, effective statistical analysis methods are still lacking. Recently, several novel methods based on integrative analysis have been proposed to tackle this challenging problem. Despite promising results, the existing studies are still limited by ignoring data contamination and making strict assumptions of linear effects of covariates on response. As such, we develop a robust nonparametric integrative analysis approach to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
