A bootstrap analysis for finite populations

Tina Nane; Kasper Kooijman

arXiv:1804.05274·stat.AP·April 17, 2018·ISSI·5 cites

A bootstrap analysis for finite populations

Tina Nane, Kasper Kooijman

PDF

Open Access

TL;DR

This paper examines the impact of ignoring finite population assumptions in bootstrap methods for confidence interval construction, demonstrating that finite population correction improves validity and accuracy in bibliometric data analysis.

Contribution

It introduces a finite population correction to bootstrap methods, enhancing confidence interval validity when analyzing finite datasets.

Findings

01

Standard bootstrap methods fail with finite populations.

02

Finite population correction improves confidence interval accuracy.

03

Variability in estimates does not vanish as sample size approaches population size.

Abstract

Bootstrap methods are increasingly accepted as one of the common approaches in constructing confidence intervals in bibliometric studies. Typical bootstrap methods assume that the statistical population is infinite. When the statistical population is finite, a correction needs to be applied in computing the estimated variance of the estimators and thus constructing confidence intervals. We investigate the effect of overlooking the finiteness assumption of the statistical population using a dataset containing all articles in Web of Science (WoS) for Delft University of Technology from 2006 until 2009. We regard the data as our finite statistical population and consider simple random samples of various sizes. Standard bootstrap methods are firstly employed in accounting for the variability of the estimates, as well as constructing the confidence intervals. The results unveil two issues,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Analysis with R · scientometrics and bibliometrics research · Advanced Text Analysis Techniques