Originality in scientific titles and abstracts can predict citation count
Jack H. Culbert, Yoed N. Kenett, Philipp Mayr

TL;DR
This study introduces a computational measure called Divergent Semantic Integration (DSI) to quantify originality in scientific titles and abstracts, demonstrating its correlation with citation counts across various research fields.
Contribution
The paper applies DSI to a large dataset of scientific abstracts and titles, revealing its potential to predict citation impact and its variation across disciplines and over time.
Findings
DSI varies significantly between research fields.
A positive correlation exists between DSI and 5-year citation counts.
DSI shows a slight increase over time.
Abstract
In this research-in-progress paper, we apply a computational measure correlating with originality from creativity science: Divergent Semantic Integration (DSI), to a selection of 99,557 scientific abstracts and titles selected from the Web of Science. We observe statistically significant differences in DSI between subject and field of research, and a slight rise in DSI over time. We model the base 10 logarithm of the citation count after 5 years with DSI and find a statistically significant positive correlation in all fields of research with an adjusted of 0.13.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · scientometrics and bibliometrics research
MethodsBalanced Selection
