Knoop: Practical Enhancement of Knockoff with Over-Parameterization for Variable Selection
Xiaochen Zhang, Yunfeng Cai, Haoyi Xiong

TL;DR
Knoop enhances knockoff-based variable selection by generating multiple knockoffs and using over-parameterized models, leading to improved accuracy and robustness in high-dimensional datasets.
Contribution
Introduces Knoop, a novel method that over-parameterizes with multiple knockoffs and anomaly testing for more effective variable selection.
Findings
Higher AUC in ROC for variable identification
Improved predictive accuracy in regression and classification
Robust performance demonstrated in simulations and real data
Abstract
Variable selection plays a crucial role in enhancing modeling effectiveness across diverse fields, addressing the challenges posed by high-dimensional datasets of correlated variables. This work introduces a novel approach namely Knockoff with over-parameterization (Knoop) to enhance Knockoff filters for variable selection. Specifically, Knoop first generates multiple knockoff variables for each original variable and integrates them with the original variables into an over-parameterized Ridgeless regression model. For each original variable, Knoop evaluates the coefficient distribution of its knockoffs and compares these with the original coefficients to conduct an anomaly-based significance test, ensuring robust variable selection. Extensive experiments demonstrate superior performance compared to existing methods in both simulation and real-world datasets. Knoop achieves a notably…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimedia Communication and Technology · Video Analysis and Summarization
