Sparse Variable Selection on High Dimensional Heterogeneous Data with Tree Structured Responses
Hui Liu, Xiang Liu, Jing Diao, Wenting Ye, Xueling Liu, Dehui Wei

TL;DR
This paper introduces a tree-guided sparse linear mixed model for variable selection in high-dimensional, heterogeneous data with complex response dependencies, improving accuracy over existing methods.
Contribution
The proposed model leverages response variable dependencies via a tree structure to enhance variable selection in heterogeneous data, addressing confounder effects.
Findings
Outperforms existing methods in synthetic data tests.
Achieves highest ROC area in real data experiments.
Effectively utilizes response dependencies for better variable selection.
Abstract
We consider the problem of sparse variable selection on high dimension heterogeneous data sets, which has been taking on renewed interest recently due to the growth of biological and medical data sets with complex, non-i.i.d. structures and huge quantities of response variables. The heterogeneity is likely to confound the association between explanatory variables and responses, resulting in enormous false discoveries when Lasso or its variants are na\"ively applied. Therefore, developing effective confounder correction methods is a growing heat point among researchers. However, ordinarily employing recent confounder correction methods will result in undesirable performance due to the ignorance of the convoluted interdependency among response variables. To fully improve current variable selection methods, we introduce a model, the tree-guided sparse linear mixed model, that can utilize…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Statistical Methods and Bayesian Inference · Gene expression and cancer classification
