Forecasting high-impact research topics via machine learning on evolving knowledge graphs
Xuemei Gu, Mario Krenn

TL;DR
This paper introduces a machine learning approach to predict the future impact of emerging research ideas by analyzing an evolving knowledge graph built from over 21 million scientific papers, enabling early impact assessment before publication.
Contribution
It presents a novel method combining semantic and impact networks in an evolving knowledge graph to forecast the impact of unpublished research ideas with high accuracy.
Findings
High prediction accuracy with AUC > 0.9 for most experiments
Effective forecasting of impact for new, unpublished research directions
Demonstrates the potential for early impact prediction in scientific discovery
Abstract
The exponential growth in scientific publications poses a severe challenge for human researchers. It forces attention to more narrow sub-fields, which makes it challenging to discover new impactful research ideas and collaborations outside one's own field. While there are ways to predict a scientific paper's future citation counts, they need the research to be finished and the paper written, usually assessing impact long after the idea was conceived. Here we show how to predict the impact of onsets of ideas that have never been published by researchers. For that, we developed a large evolving knowledge graph built from more than 21 million scientific papers. It combines a semantic network created from the content of the papers and an impact network created from the historic citations of papers. Using machine learning, we can predict the dynamic of the evolving network into the future…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks
