A Bayes-Factor-Guided Approach to Post-Double Selection with Bootstrapped Multiple Imputation
Johannes Bleher (1), Claudia Tarantola (2) ((1) Department of Econometrics, Empirical Economics & Computational Science Hub, University of Hohenheim, (2) Department of Economics, Management, Quantitative Methods, University of Milan)

TL;DR
This paper introduces a Bayesian-inspired method for aggregating variable selection results from bootstrapped and multiply imputed datasets, improving model sparsity and interpretability.
Contribution
It proposes a sequential evidence aggregation procedure based on Bayes factors that adaptively determines variable relevance without predefining iteration counts.
Findings
Outperforms existing aggregation methods in simulation studies.
Provides a variable inclusion criterion and stopping rule.
Demonstrates effectiveness in an empirical case study.
Abstract
When variable selection methods are applied to bootstrapped and multiply imputed datasets, the set of selected variables typically varies across iterations. Aggregating results via the union rule can lead to overly dense models. We propose a sequential evidence aggregation procedure that models detection outcomes across perturbation iterations as Bernoulli trials and accumulates evidence for variable relevance through a likelihood-ratio process admitting an approximate Bayes-factor interpretation. The procedure provides both a variable inclusion criterion and a stopping rule that eliminates the need to fix the number of bootstrap-imputation iterations ex ante. A Monte Carlo study across 126 scenarios and an empirical illustration demonstrate the method's performance relative to existing aggregation approaches.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
