Knockoffs for exchangeable categorical covariates
Emanuela Dreassi, Luca Pratelli, Pietro Rigo

TL;DR
This paper develops explicit formulas for constructing knockoffs for exchangeable categorical variables supported on a finite set, addressing theoretical gaps and analyzing robustness, with applications in genetics.
Contribution
It provides the first explicit formulas for knockoffs of exchangeable categorical variables supported on finite sets, and investigates their robustness to the underlying de Finetti measure.
Findings
Knockoffs outperform alternatives in controlling false discovery rate.
Knockoffs are slightly weaker in statistical power.
Explicit formulas enable better theoretical understanding.
Abstract
Let be a -variate random vector and a fixed finite set. In a number of applications, mainly in genetics, it turns out that for each . Despite the latter fact, to obtain a knockoff (in the sense of \cite{CFJL18}), is usually modeled as an absolutely continuous random vector. While comprehensible from the point of view of applications, this approximate procedure does not make sense theoretically, since is supported by the finite set . In this paper, explicit formulae for the joint distribution of are provided when and is exchangeable or partially exchangeable. In fact, when for all , there seem to be various reasons for assuming exchangeable or partially exchangeable. The robustness of , with respect to the de Finetti's measure of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Rough Sets and Fuzzy Logic · Advanced Clustering Algorithms Research
