# Correction for Participation Bias in Nonprobability Samples Using Multiple Reference Surveys

**Authors:** Victoria Landsman, Lingxiao Wang, Ivan Carrillo‐Garcia, Aya A. Mitani, Peter M. Smith, Barry I. Graubard, Trang Bui, Nancy Carnide

PMC · DOI: 10.1002/sim.70403 · Statistics in Medicine · 2026-02-23

## TL;DR

This paper introduces a new method to correct for participation bias in nonprobability samples by using multiple reference surveys, improving accuracy in health research.

## Contribution

A general framework for addressing participation bias using multiple reference surveys, including calibration estimators and variance estimation methods.

## Key findings

- The raking ratio calibration estimator performs well with highly dispersed participation probabilities.
- Calibration estimators offer practical advantages when survey microdata is limited.
- The proposed methods were successfully applied to a real-world study of working adults in Canada.

## Abstract

Health researchers are increasingly adopting nonprobability sampling strategies in survey studies. However, the participation mechanism in such samples is unknown and estimated target parameters and exposure‐outcome associations obtained from nonprobability samples can be biased. Current approaches developed to support statistical inference from nonprobability samples are unable to accommodate more than one reference sample. In this paper, we propose a general framework to address participation bias in nonprobability samples using multiple reference surveys. Previously published methods that use one reference survey are special cases within this framework. We focus primarily on the calibration estimators, another important special case in the proposed framework. These estimators have greater flexibility in situations with limited access to survey microdata and are straightforward for practical implementation. We describe two methods for variance estimation that account for all sources of variability of the proposed estimators: (1) the Taylor linearization method, which provides an analytic formula for the variance estimator, and (2) the leave‐one‐out jackknife method, a replication estimator. We assess the performance of the various methods through an extensive simulation study, which demonstrated satisfactory performance of the raking ratio calibration estimator in situations with highly dispersed participation probabilities in nonprobability samples and markedly smaller variance estimates for continuous outcomes. Finally, we illustrate the application of these methods using data from a real‐world study of working adults in Canada.

## Full-text entities

- **Genes:** ATHS (atherosclerosis susceptibility (lipoprotein associated)) [NCBI Gene 470] {aka ALP}
- **Diseases:** injury (MESH:D014947), PC (MESH:D015324), work-related injury (MESH:D000073397)
- **Chemicals:** Alcohol (MESH:D000438), PJT-186242 (-), PC (MESH:C053518)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12928181/full.md

## Figures

12 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12928181/full.md

## References

23 references — full list in the complete paper: https://tomesphere.com/paper/PMC12928181/full.md

---
Source: https://tomesphere.com/paper/PMC12928181