Virtual Proximity Citation (VCP): A Supervised Deep Learning Method to Relate Uncited Papers On Grounds of Citation Proximity
Rohit Rawat

TL;DR
This paper introduces Virtual Citation Proximity (VCP), a supervised deep learning method using Siamese neural networks to estimate relatedness between research papers based on citation proximity, even when papers are uncited.
Contribution
The paper presents a novel supervised deep learning approach that predicts citation proximity between uncited papers using a Siamese neural network trained on Wikipedia data.
Findings
VCP outperforms basic neural networks in predicting citation proximity.
Using actual citation distances improves relatedness estimation.
Potential for further improvement with complex neural networks and hyperparameter tuning.
Abstract
Citation based approaches have seen good progress for recommending research papers using citations in the paper. Citation proximity analysis which uses the in-text citation proximity to find relatedness between two research papers is better than co-citation analysis and bibliographic analysis. However, one common problem which exists in each approach is that paper should be well cited. If documents are not cited properly or not cited at all, then using these approaches will not be helpful. To overcome the problem, this paper discusses the approach Virtual Citation Proximity (VCP) which uses Siamese Neural Network along with the notion of citation proximity analysis and content-based filtering. To train this model, the actual distance between the two citations in a document is used as ground truth, this distance is the word count between the two citations. VCP is trained on Wikipedia…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Text Analysis Techniques · Text and Document Classification Technologies
