Controlling the false discovery rate in high-dimensional linear models using model-X knockoffs and $p$-values
Jinyuan Chang, Chenlong Li, Cheng Yong Tang, Zhengtian Zhu

TL;DR
This paper introduces a new method combining model-X knockoffs and debiased regression to control FDR in high-dimensional linear models, improving power and accuracy over existing methods, especially with weak signals or small samples.
Contribution
It develops a novel framework for FDR control using paired test statistics and $p$-values, with a two-step procedure that enhances power and accommodates dependent designs.
Findings
Method achieves asymptotic FDR control.
Two-step procedure improves power over baseline.
Outperforms existing approaches in simulations.
Abstract
We propose a novel multiple testing methodology for controlling the false discovery rate (FDR) in high-dimensional linear models that integrates model-X knockoff techniques with debiased penalized regression estimators. At the foundation of our methodology, we construct and study two sets of naturally paired high-dimensional test statistics and the associated -values for evaluating the same null hypotheses. The first set is shown to be asymptotically mutually independent, justifying the use of the Benjamini-Hochberg procedure. We further exploit the pairing structure through a two-step procedure aimed at improving power. Our theoretical results establish the key properties of the framework with respect to asymptotic FDR control and formally characterize the associated power gains of the two-step procedure. Importantly, our framework accommodates general dependence in the design…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods in Clinical Trials · Statistical Methods and Inference · Statistical Methods and Bayesian Inference
