A bootstrap analysis for finite populations
Tina Nane, Kasper Kooijman

TL;DR
This paper examines the impact of ignoring finite population assumptions in bootstrap methods for confidence interval construction, demonstrating that finite population correction improves validity and accuracy in bibliometric data analysis.
Contribution
It introduces a finite population correction to bootstrap methods, enhancing confidence interval validity when analyzing finite datasets.
Findings
Standard bootstrap methods fail with finite populations.
Finite population correction improves confidence interval accuracy.
Variability in estimates does not vanish as sample size approaches population size.
Abstract
Bootstrap methods are increasingly accepted as one of the common approaches in constructing confidence intervals in bibliometric studies. Typical bootstrap methods assume that the statistical population is infinite. When the statistical population is finite, a correction needs to be applied in computing the estimated variance of the estimators and thus constructing confidence intervals. We investigate the effect of overlooking the finiteness assumption of the statistical population using a dataset containing all articles in Web of Science (WoS) for Delft University of Technology from 2006 until 2009. We regard the data as our finite statistical population and consider simple random samples of various sizes. Standard bootstrap methods are firstly employed in accounting for the variability of the estimates, as well as constructing the confidence intervals. The results unveil two issues,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Analysis with R · scientometrics and bibliometrics research · Advanced Text Analysis Techniques
