On the Biased Assessment of Expert Finding Systems
Jens-Joris Decorte, Jeroen Van Hautte, Chris Develder, Thomas, Demeester

TL;DR
This paper examines how automated annotation biases in expert finding system evaluations can lead to overestimated performance and unfair comparisons, proposing constraints to improve evaluation fairness.
Contribution
It reveals biases caused by automated annotations and synonyms in expert retrieval benchmarks, and proposes constraints to mitigate these biases.
Findings
Automated annotations inflate performance metrics for traditional models.
Synonym augmentation introduces bias towards literal mentions.
Constraints can reduce bias while maintaining annotation utility.
Abstract
In large organisations, identifying experts on a given topic is crucial in leveraging the internal knowledge spread across teams and departments. So-called enterprise expert retrieval systems automatically discover and structure employees' expertise based on the vast amount of heterogeneous data available about them and the work they perform. Evaluating these systems requires comprehensive ground truth expert annotations, which are hard to obtain. Therefore, the annotation process typically relies on automated recommendations of knowledge areas to validate. This case study provides an analysis of how these recommendations can impact the evaluation of expert finding systems. We demonstrate on a popular benchmark that system-validated annotations lead to overestimated performance of traditional term-based retrieval models and even invalidate comparisons with more recent neural methods. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBig Data and Business Intelligence · Semantic Web and Ontologies
