Overcoming Selection Bias in Statistical Studies With Amortized Bayesian Inference

Jonas Arruda; Sophie Chervet; Paula Staudt; Andreas Wieser; Michael Hoelscher; Isabelle Sermet-Gaudelus; Nadine Binder; Lulla Opatowski; Jan Hasenauer

arXiv:2604.18319·stat.ML·April 21, 2026

Overcoming Selection Bias in Statistical Studies With Amortized Bayesian Inference

Jonas Arruda, Sophie Chervet, Paula Staudt, Andreas Wieser, Michael Hoelscher, Isabelle Sermet-Gaudelus, Nadine Binder, Lulla Opatowski, Jan Hasenauer

PDF

TL;DR

This paper introduces a simulation-based Bayesian inference method that explicitly models and corrects for selection bias, enabling unbiased estimation even in complex, high-dimensional models.

Contribution

It develops a bias-aware amortized Bayesian inference framework that incorporates the selection mechanism directly into the generative model, allowing for debiased estimates and bias testing.

Findings

01

The method recovers well-calibrated posterior distributions across diverse selection scenarios.

02

It outperforms traditional likelihood-based approaches in biased settings.

03

The framework includes diagnostics for bias detection and model validation.

Abstract

Selection bias arises when the probability that an observation enters a dataset depends on variables related to the quantities of interest, leading to systematic distortions in estimation and uncertainty quantification. For example, in epidemiological or survey settings, individuals with certain outcomes may be more likely to be included, resulting in biased prevalence estimates with potentially substantial downstream impact. Classical corrections, such as inverse-probability weighting or explicit likelihood-based models of the selection process, rely on tractable likelihoods, which limits their applicability in complex stochastic models with latent dynamics or high-dimensional structure. Simulation-based inference enables Bayesian analysis without tractable likelihoods but typically assumes missingness at random and thus fails when selection depends on unobserved outcomes or…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.