A Bayes-Factor-Guided Approach to Post-Double Selection with Bootstrapped Multiple Imputation

Johannes Bleher (1); Claudia Tarantola (2) ((1) Department of Econometrics; Empirical Economics & Computational Science Hub; University of Hohenheim; (2) Department of Economics; Management; Quantitative Methods; University of Milan)

arXiv:2604.12783·stat.ME·April 23, 2026

A Bayes-Factor-Guided Approach to Post-Double Selection with Bootstrapped Multiple Imputation

Johannes Bleher (1), Claudia Tarantola (2) ((1) Department of Econometrics, Empirical Economics & Computational Science Hub, University of Hohenheim, (2) Department of Economics, Management, Quantitative Methods, University of Milan)

PDF

TL;DR

This paper introduces a Bayesian-inspired method for aggregating variable selection results from bootstrapped and multiply imputed datasets, improving model sparsity and interpretability.

Contribution

It proposes a sequential evidence aggregation procedure based on Bayes factors that adaptively determines variable relevance without predefining iteration counts.

Findings

01

Outperforms existing aggregation methods in simulation studies.

02

Provides a variable inclusion criterion and stopping rule.

03

Demonstrates effectiveness in an empirical case study.

Abstract

When variable selection methods are applied to bootstrapped and multiply imputed datasets, the set of selected variables typically varies across iterations. Aggregating results via the union rule can lead to overly dense models. We propose a sequential evidence aggregation procedure that models detection outcomes across perturbation iterations as Bernoulli trials and accumulates evidence for variable relevance through a likelihood-ratio process admitting an approximate Bayes-factor interpretation. The procedure provides both a variable inclusion criterion and a stopping rule that eliminates the need to fix the number of bootstrap-imputation iterations ex ante. A Monte Carlo study across 126 scenarios and an empirical illustration demonstrate the method's performance relative to existing aggregation approaches.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.