Robust Model-based Inference for Non-Probability Samples

Ali Rafei; Michael R. Elliott; Carol A. C. Flannagan

arXiv:2204.03215·stat.ME·April 8, 2022

Robust Model-based Inference for Non-Probability Samples

Ali Rafei, Michael R. Elliott, Carol A. C. Flannagan

PDF

Open Access

TL;DR

This paper introduces a fully model-based Bayesian bootstrap approach for inference in non-probability samples, effectively addressing selection bias and outliers while providing reliable uncertainty estimates.

Contribution

It proposes a novel Bayesian bootstrap method with Gaussian process regression for non-probability sample inference, handling model misspecification and complex survey designs.

Findings

01

Method performs well in simulations under various conditions.

02

Accurately estimates injury rates in real-world US data.

03

Provides valid confidence intervals despite complex sampling.

Abstract

With the ubiquitous availability of unstructured data, growing attention is paid as how to adjust for selection bias in such non-probability samples. The majority of the robust estimators proposed by prior literature are either fully or partially design-based, which may lead to inefficient estimates if outlying (pseudo-)weights are present. In addition, correctly reflecting the uncertainty of the adjusted estimator remains a challenge when the available reference survey is complex in the sample design. This article proposes a fully model-based method for inference using non-probability samples where the goal is to predict the outcome variable for the entire population units. We employ a Bayesian bootstrap method with Rubin's combing rules to derive the adjusted point and interval estimates. Using Gaussian process regression, our method allows for kernel matching between the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Bayesian Inference · Gaussian Processes and Bayesian Inference · Advanced Statistical Methods and Models