Predicting Award Winning Research Papers at Publication Time

Riccardo Vella; Andrea Vitaletti; Fabrizio Silvestri

arXiv:2406.12535·cs.IR·June 19, 2024

Predicting Award Winning Research Papers at Publication Time

Riccardo Vella, Andrea Vitaletti, Fabrizio Silvestri

PDF

Open Access

TL;DR

This paper presents a method to predict whether a research paper will win an award using only information available at publication time, combining citation graph features and textual analysis.

Contribution

It introduces a novel approach that integrates citation subgraph features with textual features for early prediction of award-winning papers.

Findings

01

F1 score of 0.694 in prediction accuracy

02

High recall and low false negatives for identifying non-winning papers

03

Insights into topological and textual feature patterns

Abstract

In recent years, many studies have been focusing on predicting the scientific impact of research papers. Most of these predictions are based on citations count or rely on features obtainable only from already published papers. In this study, we predict the likelihood for a research paper of winning an award only relying on information available at publication time. For each paper, we build the citation subgraph induced from its bibliography. We initially consider some features of this subgraph, such as the density and the global clustering coefficient, to make our prediction. Then, we mix this information with textual features, extracted from the abstract and the title, to obtain a more accurate final prediction. We made our experiments considering the ArnetMiner citation graph, while the ground truth on award-winning papers has been obtained from a collection of best paper awards from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOnline Learning and Analytics · Big Data and Business Intelligence · Academic Writing and Publishing