Effective Distributed Representations for Academic Expert Search

Mark Berger; Jakub Zavrel; Paul Groth

arXiv:2010.08269·cs.IR·November 10, 2022

Effective Distributed Representations for Academic Expert Search

Mark Berger, Jakub Zavrel, Paul Groth

PDF

1 Repo

TL;DR

This paper investigates how various distributed representations of academic papers, especially contextualized embeddings, affect the effectiveness of academic expert search, highlighting the superiority of transformer-based embeddings.

Contribution

It demonstrates that transformer-based contextual embeddings significantly improve expert retrieval performance over other embedding methods.

Findings

01

Contextualized embeddings outperform traditional methods.

02

Retrofitting embeddings does not enhance retrieval.

03

Author contribution weighting strategies have limited impact.

Abstract

Expert search aims to find and rank experts based on a user's query. In academia, retrieving experts is an efficient way to navigate through a large amount of academic knowledge. Here, we study how different distributed representations of academic papers (i.e. embeddings) impact academic expert retrieval. We use the Microsoft Academic Graph dataset and experiment with different configurations of a document-centric voting model for retrieval. In particular, we explore the impact of the use of contextualized embeddings on search performance. We also present results for paper embeddings that incorporate citation information through retrofitting. Additionally, experiments are conducted using different techniques for assigning author weights based on author order. We observe that using contextual embeddings produced by a transformer model trained for sentence similarity tasks produces the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mabergerx/SDP500_expert_search
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.