Accounting for sample selection in Bayesian analyses
Samuel R. Hinton, Alex Kim, Tamara M. Davis

TL;DR
This paper introduces Bayesian methods to correct for sample selection biases in astronomical data, using analytic approximations and Monte Carlo integration, with broad applicability across fields facing similar issues.
Contribution
It provides a unified Bayesian framework with practical solutions for both tractable and intractable models, including sample correction techniques and implementation guidance.
Findings
Effective correction for sample selection biases demonstrated with toy models
Method applicable to data truncation and impure samples
Sample code provided for practical implementation
Abstract
Astronomers are often confronted with funky populations and distributions of objects: brighter objects are more likely to be detected; targets are selected based on colour cuts; imperfect classification yields impure samples. Failing to account for these effects leads to biased analyses. In this paper we present a simple overview of a Bayesian consideration of sample selection, giving solutions to both analytically tractable and intractable models. This is accomplished via a combination of analytic approximations and Monte Carlo integration, in which dataset simulation is efficiently used to correct for issues in the observed dataset. This methodology is also applicable for data truncation, such as requiring densities to be strictly positive. Toy models are included for demonstration, along with discussions of numerical considerations and how to optimise for implementation. We provide…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Bayesian Inference · Gaussian Processes and Bayesian Inference · Advanced Statistical Methods and Models
