Derandomized Truncated D-vine Copula Knockoffs with e-values to control the false discovery rate
Alejandro Rom\'an V\'asquez, Jos\'e Ulises M\'arquez Urbina, Graciela, Gonz\'alez Far\'ias, Gabriel Escarela

TL;DR
This paper introduces a novel derandomized method for variable selection using truncated D-vine copula knockoffs with e-values, improving statistical power and robustness in controlling the false discovery rate, especially in gene expression data.
Contribution
The paper proposes a new Truncated D-vine Copula Knockoffs algorithm with derandomization and non-parametric marginal transformations, enhancing existing methods for multivariate variable selection.
Findings
Improved statistical power through copula truncation.
Enhanced robustness and reliability in gene selection.
Superior performance compared to existing methods in simulations and real data.
Abstract
The Model-X knockoffs is a practical methodology for variable selection, which stands out from other selection strategies since it allows for the control of the false discovery rate (FDR), relying on finite-sample guarantees. In this article, we propose a Truncated D-vine Copula Knockoffs (TDCK) algorithm for sampling approximate knockoffs from complex multivariate distributions. Our algorithm enhances and improves features of previous attempts to sample knockoffs under the multivariate setting, with the three main contributions being: 1) the truncation of the D-vine copula, which reduces the dependence between the original variables and their corresponding knockoffs, improving the statistical power; 2) the employment of a straightforward non-parametric formulation for marginal transformations, eliminating the need for a specific parametric family or a kernel density estimator; 3) the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDNA and Biological Computing · Advanced Malware Detection Techniques · Algorithms and Data Compression
