Expected values in percentile indicators

Lutz Bornmann; Robin Haunschild

arXiv:1611.05206·cs.DL·November 17, 2016

Expected values in percentile indicators

Lutz Bornmann, Robin Haunschild

PDF

Open Access

TL;DR

This paper empirically tests whether the expected proportion of highly cited papers (PP(top x%)) matches the actual observed proportions when sampling from a citation database, revealing deviations from the theoretical expectation.

Contribution

It provides an empirical evaluation of the expected values of percentile-based citation indicators and quantifies deviations through random sampling.

Findings

01

Deviations from expected PP(top x%) values are observed in empirical samples.

02

Sampling variability affects the accuracy of percentile indicators.

03

Results inform the reliability of using PP(top x%) in research assessment.

Abstract

PP(top x%) is the proportion of papers of a unit (e.g. an institution or a group of researchers), which belongs to the x% most frequently cited papers in the corresponding fields and publication years. It has been proposed that x% of papers can be expected which belongs to the x% most frequently cited papers. In this Letter to the Editor we will present the results of an empirical test whether we can really have this expectation and how strong the deviations from the expected values are when many random samples are drawn from the database.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topicsscientometrics and bibliometrics research