Marginal and Interactive Feature Screening of Ultra-high Dimensional Feature Spaces with Multivariate Response
Randall Reese

TL;DR
This paper introduces GenCorr, a novel feature screening method for ultra-high dimensional data with multivariate responses, improving robustness and performance over existing methods, especially in complex traits analysis.
Contribution
The paper proposes GenCorr, a new framework for marginal and interactive feature screening applicable to multivariate responses, addressing limitations of existing univariate methods.
Findings
GenCorr demonstrates strong sure screening properties.
GenCorr outperforms existing methods in numerical simulations.
Real data analysis on GWAS data validates effectiveness.
Abstract
When the number of features exponentially outnumbers the number of samples, feature screening plays a pivotal role in reducing the dimension of the feature space and developing models based on such data. While most extant feature screening approaches are only applicable to data having univariate response, we propose a new method (GenCorr) that admits a multivariate response. Such an approach allows us to more appropriately model multiple responses as a single unit, rather than as unrelated entities, which avails more robust analyses in relation to complex traits embedded in the covariance structure of multiple responses. The GenCorr framework allows for the screening of both marginal as well as interactive features. It is demonstrated that GenCorr possesses the desirable property of strong sure screening. In the marginal case, we examine the superior numerical performance of GenCorr in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Gene expression and cancer classification · Statistical Methods and Bayesian Inference
