TL;DR
ExpFinder is a new ensemble model that combines an N-gram vector space approach with a graph-based algorithm to improve expert finding accuracy in scientific data, outperforming existing methods significantly.
Contribution
The paper introduces ExpFinder, integrating a novel N-gram vector space model with a modified CO-HITS algorithm for enhanced expert identification.
Findings
ExpFinder outperforms six existing models by 19% to 160.2%.
The N-gram vector space model effectively captures recent inverse document frequency features.
The ensemble approach significantly improves expert finding accuracy.
Abstract
Finding an expert plays a crucial role in driving successful collaborations and speeding up high-quality research development and innovations. However, the rapid growth of scientific publications and digital expertise data makes identifying the right experts a challenging problem. Existing approaches for finding experts given a topic can be categorised into information retrieval techniques based on vector space models, document language models, and graph-based models. In this paper, we propose , a new ensemble model for expert finding, that integrates a novel -gram vector space model, denoted as VSM, and a graph-based model, denoted as \textit{\muCO-HITS}, that is a proposed variation of the CO-HITS algorithm. The key of VSM is to exploit recent inverse document frequency weighting method for -gram words and incorporates VSM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
