Sample Size Determination Under Selection Bias: Robust Tolerance Limits for Prevalent Cohort Data

James H. McVittie; Martin Lysy; Masoud Asgharian

arXiv:2605.19878·stat.ME·May 20, 2026

Sample Size Determination Under Selection Bias: Robust Tolerance Limits for Prevalent Cohort Data

James H. McVittie, Martin Lysy, Masoud Asgharian

PDF

TL;DR

This paper extends classical tolerance limit formulas to account for selection bias in cohort data, providing robust sample size calculations for biased sampling schemes in medical studies.

Contribution

It introduces modified formulas for sample size determination that accommodate bias, validated through simulations and real data application.

Findings

01

Modified formulas perform well under various bias conditions.

02

Simulation results show improved accuracy over traditional methods.

03

Application to dementia data demonstrates practical utility.

Abstract

Tolerance limits have received considerable attention in the statistical literature, with applications reaching far beyond their initial role in quality control. The well-known formula of Scheff\'e and Tukey (1944) establishes a simple, distribution-free relation between sample size and population coverage by two given order statistics and a given confidence level. A key requirement in applying this formula is the availability of an unbiased, representative sample from the population of interest. However, as it often happens in biological and medical applications, various logistical constraints may preclude the possibility of obtaining an unbiased sample. We derive extensions of this formula which accommodate a large class of biased sampling schemes including weight bias and censoring. The modified formulae are validated through a simulation study and compared to its unmodified…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.