Derandomizing Knockoffs
Zhimei Ren, Yuting Wei, Emmanuel Cand\`es

TL;DR
This paper introduces a derandomization technique for Model-X knockoffs, enhancing stability and power in variable selection while maintaining rigorous error control, demonstrated through simulations and genome-wide association studies.
Contribution
It proposes a flexible derandomization method for knockoffs that improves stability and power without losing statistical guarantees.
Findings
Derandomized knockoffs control PFER and k-FWER.
Enhanced power compared to existing algorithms.
Successfully applied to genome-wide association studies.
Abstract
Model-X knockoffs is a general procedure that can leverage any feature importance measure to produce a variable selection algorithm, which discovers true effects while rigorously controlling the number or fraction of false positives. Model-X knockoffs is a randomized procedure which relies on the one-time construction of synthetic (random) variables. This paper introduces a derandomization method by aggregating the selection results across multiple runs of the knockoffs algorithm. The derandomization step is designed to be flexible and can be adapted to any variable selection base procedure to yield stable decisions without compromising statistical power. When applied to the base procedure of Janson et al. (2016), we prove that derandomized knockoffs controls both the per family error rate (PFER) and the k family-wise error rate (k-FWER). Further, we carry out extensive numerical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic Associations and Epidemiology · Statistical Methods and Inference · Statistical Methods in Clinical Trials
