A knockoff filter for high-dimensional selective inference
Rina Foygel Barber, Emmanuel J. Candes

TL;DR
This paper introduces a high-dimensional inference framework using knockoff filters that controls false discoveries and signs of effects, demonstrated through simulations and a genome-wide association study.
Contribution
It develops a novel high-dimensional inference method combining data splitting and knockoff filters to control directional FDR in complex models.
Findings
Controls directional false discovery rate in high-dimensional settings
Achieves higher power than existing methods in simulations
Successfully applied to genome-wide association data
Abstract
This paper develops a framework for testing for associations in a possibly high-dimensional linear model where the number of features/variables may far exceed the number of observational units. In this framework, the observations are split into two groups, where the first group is used to screen for a set of potentially relevant variables, whereas the second is used for inference over this reduced set of variables; we also develop strategies for leveraging information from the first part of the data at the inference step for greater power. In our work, the inferential step is carried out by applying the recently introduced knockoff filter, which creates a knockoff copy-a fake variable serving as a control-for each screened variable. We prove that this procedure controls the directional false discovery rate (FDR) in the reduced model controlling for all screened variables; this says that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
