Derandomizing Knockoffs

Zhimei Ren; Yuting Wei; Emmanuel Cand\`es

arXiv:2012.02717·stat.ME·December 7, 2020·6 cites

Derandomizing Knockoffs

Zhimei Ren, Yuting Wei, Emmanuel Cand\`es

PDF

Open Access 2 Repos

TL;DR

This paper introduces a derandomization technique for Model-X knockoffs, enhancing stability and power in variable selection while maintaining rigorous error control, demonstrated through simulations and genome-wide association studies.

Contribution

It proposes a flexible derandomization method for knockoffs that improves stability and power without losing statistical guarantees.

Findings

01

Derandomized knockoffs control PFER and k-FWER.

02

Enhanced power compared to existing algorithms.

03

Successfully applied to genome-wide association studies.

Abstract

Model-X knockoffs is a general procedure that can leverage any feature importance measure to produce a variable selection algorithm, which discovers true effects while rigorously controlling the number or fraction of false positives. Model-X knockoffs is a randomized procedure which relies on the one-time construction of synthetic (random) variables. This paper introduces a derandomization method by aggregating the selection results across multiple runs of the knockoffs algorithm. The derandomization step is designed to be flexible and can be adapted to any variable selection base procedure to yield stable decisions without compromising statistical power. When applied to the base procedure of Janson et al. (2016), we prove that derandomized knockoffs controls both the per family error rate (PFER) and the k family-wise error rate (k-FWER). Further, we carry out extensive numerical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenetic Associations and Epidemiology · Statistical Methods and Inference · Statistical Methods in Clinical Trials