Paper2vec: Citation-Context Based Document Distributed Representation for Scholar Recommendation
Han Tian, Hankz Hankui Zhuo

TL;DR
This paper introduces Paper2vec, a novel method for scholar recommendation that uses citation-context based distributed representations of papers, outperforming existing citation and embedding methods by 25%.
Contribution
It proposes a new citation-context based distributed representation approach for measuring paper similarity, overcoming co-occurrence limitations of previous methods.
Findings
Outperforms state-of-the-art citation-based methods by 25%.
Better than other distributed representation methods.
Effective in capturing paper similarity from citation context.
Abstract
Due to the availability of references of research papers and the rich information contained in papers, various citation analysis approaches have been proposed to identify similar documents for scholar recommendation. Despite of the success of previous approaches, they are, however, based on co-occurrence of items. Once there are no co-occurrence items available in documents, they will not work well. Inspired by distributed representations of words in the literature of natural language processing, we propose a novel approach to measuring the similarity of papers based on distributed representations learned from the citation context of papers. We view the set of papers as the vocabulary, define the weighted citation context of papers, and convert it to weight matrix similar to the word-word cooccurrence matrix in natural language processing. After that we explore a variant of matrix…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Recommender Systems and Techniques · Advanced Graph Neural Networks
