Gold after Randomized Sand: Model-X Split Knockoffs for Controlled Transformation Selection
Yang Cao, Hangyu Lin, Xinwei Sun, Yuan Yao

TL;DR
This paper extends the Split Knockoff method to random designs, enabling controlled variable transformation selection with finite-sample FDR control in broader models, demonstrated through simulations and real-world data.
Contribution
We introduce Model-X Split Knockoffs, a novel approach that achieves FDR control in random designs by using an auxiliary randomized design to handle the interaction between randomness and deterministic transformations.
Findings
Achieves robust FDR control in simulations and real data.
Provides at least as much power as standard Model-X Knockoffs.
Demonstrates applicability to complex real-world problems like Alzheimer's imaging.
Abstract
Controlling the False Discovery Rate (FDR) is critical for reproducible variable selection, especially given the prevalence of complex predictive modeling. The recent Split Knockoff method, an extension of the canonical Knockoffs framework, offers finite-sample FDR control for selecting sparse transformations but is limited to linear models with fixed designs. Extending this framework to random designs, which would accommodate a much broader range of models, is challenged by the fundamental difficulty of reconciling a random covariate design with a deterministic linear transformation. To bridge this gap, we introduce Model-X Split Knockoffs. Our method achieves robust FDR control for transformation selection in random designs by introducing a novel auxiliary randomized design. This key innovation effectively mediates the interaction between the random design and the deterministic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimal Experimental Design Methods · Statistical Methods in Clinical Trials · Statistical Methods and Inference
