TL;DR
This paper introduces a new method for constructing knockoffs called MRC that improves power by reducing feature reconstructability, outperforming traditional MAC-minimizing knockoffs especially in correlated Gaussian models.
Contribution
The paper proposes the MRC approach for generating knockoffs, which enhances feature selection power by minimizing reconstructability, and demonstrates its effectiveness through theoretical analysis and extensive simulations.
Findings
MRC knockoffs outperform MAC knockoffs in correlated Gaussian models.
MRC knockoffs are computationally efficient and robust.
The new method minimizes estimation error in Gaussian linear models.
Abstract
Model-X knockoffs allows analysts to perform feature selection using almost any machine learning algorithm while still provably controlling the expected proportion of false discoveries. To apply model-X knockoffs, one must construct synthetic variables, called knockoffs, which effectively act as controls during feature selection. The gold standard for constructing knockoffs has been to minimize the mean absolute correlation (MAC) between features and their knockoffs, but, surprisingly, we prove this procedure can be powerless in extremely easy settings, including Gaussian linear models with correlated exchangeable features. The key problem is that minimizing the MAC creates strong joint dependencies between the features and knockoffs, which allow machine learning algorithms to partially or fully reconstruct the effect of the features on the response using the knockoffs. To improve the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
