Investigating the originality of scientific papers across time and domain: A quantitative analysis
Jack H. Culbert, Yoed N. Kenett, Philipp Mayr

TL;DR
This paper introduces a computational measure called Divergent Semantic Integration (DSI), adapted with SciBERT, to quantify the originality of scientific texts and analyzes its relationship with citations across fields and time.
Contribution
It advances the DSI metric by integrating SciBERT for scientific text analysis and applies it to a large dataset to assess originality's impact on citations.
Findings
DSI correlates with early citations more for papers with fewer authors.
DSI varies significantly across scientific fields.
DSI's correlation with citation counts declines over time.
Abstract
The study of creativity in science has long sought quantitative metrics capable of capturing the originality of the scientific insights contained within articles and other scientific works. In recent years, the field has witnessed a substantial expansion of research activity, enabled by advances in natural language processing and network analysis, and has utilised both macro- and micro-scale approaches with success. However, they often do not examine the text itself for evidence of originality. In this paper, we apply a computational measure correlating with originality from creativity science, Divergent Semantic Integration (DSI), to a set of 51,200 scientific abstracts and titles sourced from the Web of Science. To adapt DSI for application to scientific texts, we advance the original BERT method by incorporating SciBERT (a model trained on scientific corpora) into the computation of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topicsscientometrics and bibliometrics research · Biomedical Text Mining and Ontologies · Creativity in Education and Neuroscience
