Do we measure novelty when we analyze unusual combinations of cited references? A validation study of bibliometric novelty indicators based on F1000Prime data
Lutz Bornmann, Alexander Tekles, Helena H. Zhang, Fred Y. Ye

TL;DR
This study evaluates the validity of bibliometric novelty indicators based on cited references by comparing them with expert assessments in biomedical research, finding that one score correlates well while the other does not.
Contribution
It provides a quantitative validation of two bibliometric novelty scores against expert judgments, recommending the use of one score over the other.
Findings
Novelty score U correlates with expert assessments.
Novelty score W shows limited validity.
Score U is recommended for measuring novelty.
Abstract
Lee, Walsh, and Wang (2015) - based on Uzzi, Mukherjee, Stringer, and Jones (2013) - and Wang, Veugelers, and Stephan (2017) proposed scores based on cited references (cited journals) data which can be used to measure the novelty of papers (named as novelty scores U and W in this study). Although previous research has used novelty scores in various empirical analyses, no study has been published up to now - to the best of our knowledge - which quantitatively tested the convergent validity of novelty scores: do these scores measure what they propose to measure? Using novelty assessments by faculty members (FMs) at F1000Prime for comparison, we tested the convergent validity of the two novelty scores (U and W). FMs' assessments not only refer to the quality of biomedical papers, but also to their characteristics (by assigning certain tags to the papers): for example, are the presented…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
