Finite Population Sampling as n to N: Empirical Evidence for the Transition from Inference to Accuracy
Mike Crowhurst

TL;DR
This study empirically investigates how estimators behave as the sampling fraction approaches one, revealing that sampling variability diminishes significantly before full population enumeration, impacting inference in large-scale data analysis.
Contribution
The paper provides empirical evidence on the transition from inference to accuracy as the sampling fraction nears one, challenging traditional assumptions in large-scale finite population sampling.
Findings
Sampling variability approaches zero as sampling fraction nears one.
Remaining deviations are due to numerical precision and computational factors.
Results support reassessment of inferential methods in high-coverage data environments.
Abstract
The Central Limit Theorem provides a foundation for inferential statistics and hypothesis testing. It describes how standardized statistics behave under repeated sampling from large populations. However, if the size of the sample (n) becomes so large that it approaches the size of the population (N), sampling variability becomes very small, and standard errors and margins of error both approach zero. The purpose of this project was to investigate the behavior of estimators as the sampling fraction (f = n/N) approaches 1, motivated by modern data streams from administrative records, transaction logs, sensor systems, and institutional databases that capture large portions of finite populations. We constructed two finite populations with known parameters and drew repeated samples across a range of sampling fractions. We then examined the resulting randomization distributions of the sample…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
