An Analysis of Variations in the Effectiveness of Query Performance Prediction
Debasis Ganguly, Suchana Datta, Mandar Mitra, Derek Greene

TL;DR
This paper investigates how variations in ground truth metrics for query performance prediction (QPP) influence experimental outcomes, revealing significant variability in results and proposing optimal evaluation configurations.
Contribution
It analyzes the impact of ground truth variability on QPP evaluation outcomes and identifies configurations that minimize result fluctuations.
Findings
QPP outcomes vary significantly with different ground truth metrics.
Certain evaluation metric and setting combinations reduce variability.
Results inform more stable QPP evaluation practices.
Abstract
A query performance predictor estimates the retrieval effectiveness of an IR system for a given query. An important characteristic of QPP evaluation is that, since the ground truth retrieval effectiveness for QPP evaluation can be measured with different metrics, the ground truth itself is not absolute, which is in contrast to other retrieval tasks, such as that of ad-hoc retrieval. Motivated by this argument, the objective of this paper is to investigate how such variances in the ground truth for QPP evaluation can affect the outcomes of QPP experiments. We consider this not only in terms of the absolute values of the evaluation metrics being reported (e.g. Pearson's , Kendall's ), but also with respect to the changes in the ranks of different QPP systems when ordered by the QPP metric scores. Our experiments reveal that the observed QPP outcomes can vary considerably, both in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Information Retrieval and Search Behavior · Advanced Database Systems and Queries
