Adjusting the Benjamini-Hochberg method for controlling the false discovery rate in knockoff assisted variable selection
Sanat K. Sarkar, Cheng Yong Tang

TL;DR
This paper adapts the Benjamini-Hochberg procedure for knockoff-based variable selection in high-dimensional regression, providing a more flexible and powerful false discovery rate control method.
Contribution
It introduces a modified BH procedure that is valid for knockoff-based variable selection, independent of variable correlation structure, and includes a data-adaptive version estimating the proportion of nulls.
Findings
The proposed method controls FDR effectively in simulations.
It outperforms existing knockoff methods in power.
The method is validated on real data applications.
Abstract
The knockoff-based multiple testing setup of Barber & Candes (2015) for variable selection in multiple regression where sample size is as large as the number of explanatory variables is considered. The method of Benjamini & Hochberg (1995) based on ordinary least squares estimates of the regression coefficients is adjusted to the setup, transforming it to a valid p-value based false discovery rate controlling method not relying on any specific correlation structure of the explanatory variables. Simulations and real data applications show that our proposed method that is agnostic to {\pi}0, the proportion of unimportant explanatory variables, and a data-adaptive version of it that uses an estimate of {\pi}0 are powerful competitors of the false discovery rate controlling method in Barber & Candes (2015).
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
