TL;DR
This paper investigates how various distributed representations of academic papers, especially contextualized embeddings, affect the effectiveness of academic expert search, highlighting the superiority of transformer-based embeddings.
Contribution
It demonstrates that transformer-based contextual embeddings significantly improve expert retrieval performance over other embedding methods.
Findings
Contextualized embeddings outperform traditional methods.
Retrofitting embeddings does not enhance retrieval.
Author contribution weighting strategies have limited impact.
Abstract
Expert search aims to find and rank experts based on a user's query. In academia, retrieving experts is an efficient way to navigate through a large amount of academic knowledge. Here, we study how different distributed representations of academic papers (i.e. embeddings) impact academic expert retrieval. We use the Microsoft Academic Graph dataset and experiment with different configurations of a document-centric voting model for retrieval. In particular, we explore the impact of the use of contextualized embeddings on search performance. We also present results for paper embeddings that incorporate citation information through retrofitting. Additionally, experiments are conducted using different techniques for assigning author weights based on author order. We observe that using contextual embeddings produced by a transformer model trained for sentence similarity tasks produces the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
