A Potential Approach to Overcome Data Limitation in Scientific Publication Recommendation
Hung Nghiep Tran, Tin Huynh, Kiem Hoang

TL;DR
This paper explores using researcher-referenced publications as an alternative data source for scientific publication recommendation, combining theoretical and empirical analyses to assess its viability and advantages.
Contribution
It introduces a novel approach to overcome data limitations by leveraging referenced publications and provides a new dataset for the computer science domain.
Findings
The approach is generally reasonable and advantageous.
Empirical results show both positive and negative outcomes.
A new dataset has been published to support further research.
Abstract
Data are essential for the experiments of relevant scientific publication recommendation methods but it is difficult to build ground truth data. A naturally promising solution is using publications that are referenced by researchers to build their ground truth data. Unfortunately, this approach has not been explored in the literature, so its applicability is still a gap in our knowledge. In this research, we systematically study this approach by theoretical and empirical analyses. In general, the results show that this approach is reasonable and has many advantages. However, the empirical analysis shows both positive and negative results. We conclude that, in some situations, this is a useful alternative approach toward overcoming data limitation. Based on this approach, we build and publish a dataset in computer science domain to help advancing other researches.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
